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OB PROTEIN RECEPTOR AND RELATED COMPOSITIONS AND METHODS 
FTRT,D OF THE INVENTION 

5 The present invention relates to OB protein 

receptors, related compositions and methods of making 
and using such receptors and related compositions. 

BAKKflROPMD 

10 Although the molecular basis for obesity is 

largely unknown, the identification of the "OB gene" and 
protein encoded ("OB protein") has shed some light on 
mechanisms the body uses to regulate body fat deposi- 
tion. Zhang et al., Nature 22Z* 425-432 (1994); see 

15 also , the Correction at Nature 374 : 479 (1995) . The OB 
protein is active in vivo in both ob/ob mutant mice 
(mice obese due to a defect in the production of the OB 
gene product} as well as in normal, wild type mice. The 
biological activity manifests itself in, among other 

20 things, weight loss. See generally ,. Barinaga, "Obese" 

Protein Slims Mice, Science 2f&: 475-476 (1995). £££ PCT 
International Publication Number WO 96/05309, 
"Modulators of Body Weight, Corresponding Nucleic Acids 
and Proteins, and Diagnostic and Therapeutic Uses 

25 Thereof," herein incorporated by reference. 

The other biological effects of OB protein are 
not well characterized. It is known, for instance, that 
in ob/ob mutant mice, administration of OB protein 
results in a decrease in serum insulin levels, and serum 

30 glucose levels. It is also known that administration of 
OB protein results in a decrease in body fat. This was 
observed in both ob/ob mutant mice, as well as non-obese 
normal mice. Pelleymounter et al., Science 2f&: 540-543 
(1995); Halaas et al., Science 2£2: 543-546 (1995). 5fi£ 
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also , Campfield et al., Science 2&: 546-549 (1995) 
(Periph ral and central administration of microgram 
doses of OB protein reduced food intake and body weight 
of oh/ oh and diet- induced obese mice but not in dh/dh 
5 obese mice.) In none of these reports have toxicities 
been observed, even at the highest doses. 

Despite the promise of clinical application 
of the OB protein, the mode of action of the OB protein 
in vivo is not clearly elucidated, in part due to the 

10 absence of information on the OB receptor. High affinity 
binding of the OB protein has been detected in the rat 
hypothalamus, reportedly indicating OB receptor loca- 
tion. Stephens et al., Nature 222: 530-532 (1995). The 
dh/dh mouse displays the identical phenotype as the 

15 oh/oh mouse, Ju£., extreme obesity and Type II diabetes; 
this phenotype is thought to be due to a defective OB 
receptor, particularly since dh/dh mice fail to respond 
to OB protein administration. Sfifi Stephens et al., 
supra . 

20 Identification of the OB protein receptor is 

key in determining the pathway of signal transduction. 
Moreover, identification of the OB protein receptor 
would provide powerful application in diagnostic uses, 
for example, to determine if individuals would benefit 

25 from OB protein therapy. Furthermore, the OB receptor 
could be a key component in an assay for determining 
additional molecules which bind to the receptor and 
result in desired biological activity. Further, such 
soluble receptor could enhance or alter the effective- 

30 ness of OB protein (or analog or derivative thereof) . 

^TTMMARY OF THF. TMVENTIQN 

The present invention relates to a novel class 
of protein receptors, herein denominated "OB protein 
35 receptors" or "OB receptors", which are thought to 
selectively bind OB protein. As such, the novel OB 
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receptor family is provided, as well as novel members of 
such family. Also provided are nucleic acids, vectors 
and host cells containing such nucleic acids, related 
antisense nucleic acids, molecules which selectively 
5 bind to the OB protein receptor, and related composi- 
tions of matter, such as OB receptor protein/OB protein 
complexes. In other aspects, the present invention 
relates to methods of using the above compositions, such 
as therapeutic and/or diagnostic methods, and methods 
10 for preparing OB receptor ligands. 



pF.TATT/RD DESCRIPTION 

A novel family of OB receptors is provided. 
This novel family resulted from identification of a PCR 

15 fragment isolated from a human liver cell cDNA library. 
The original PCR fragment, from which primers were 
isolated, contained a "WSXWS" raotif, common to cytokine 
receptors. As illustrated by the working examples 
below, using this fragment four members of this OB 

20 protein receptor family have been identified. These 

members, herein designated as "A" , -B", and *'C", and "D M 
are indentical at amino acid position 1-891 (using the 
numbering of Seq. ID No. 1), but diverge at position 892 
through the C-terminus. They vary in length at the 

25 C-terminus beyond amino acid 891, and the different 
forms appear to have different tissue distribution. 

Using hydrophobicity analysis, the leader 
sequence is likely to comprise amino acids (Seq. ID. 
No. 1) 1-21, 1-22, or 1-28. The first amino acid of the 

30 mature protein is likely to be 22 (F) , 23 (N) or 29 (T) . 
Most likely* based on analysis of eucaryotic cell 
expression (CHO cell expression see Example 8, tnfra l , 

the first amino acid of the mature protein is 22 (F) . 
The beginning of the transmembrane domain appears to be 
35 located at position 840 (A) or 842 (L) . The end of the 
transmembrane domain appears to be located at position 
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862 (I), 863 (S) or 864 (H) . Thus, based on predictions 
from hydrophobicity analysis, for OB protein binding, at 
a minimum what is needed is the extracellular domain of 
the mature protein, amino acids 22, 23 or 29 through 
5 amino acids 839 (D) or 841 (G) . Therefore, the present 
class of OB receptor proteins includes those having 
' amino acids (according to Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896; 
10 (c) 23-896; 

(d) 29-896; 

(e) 1-839; 

(f) 22-839; 
(h) 1-641; 

15 (i) 22-841; 

(j) 23-841; 

()c) 29-841; 

<1) 1-891; 

(m) 22-891; 

20 (n) 23-891; 

(o) 29-891; 

<p> the amino acids of subparts (1) 

through (o) having the C-terminal amino acids selected 
from among: 

25 (i) OB receptor B {Seq. ID No. 3) 
positions 892-904; 

(ii) OB receptor C (Seq. ID No. 5) 

positions 892- 958; and, 

(iii) OB receptor D (Seq. ID No. 7) 

30 positions 892-1165; 

(q) amino acids of subparts b, c, 

d, f, g, i, j, k, nt, n, o, and any of (p) lacking a 
leader sequence, which have an N-terminal methionyl 
residue. 

35 Also provided herein is what is thought to be 

a human splice variant of a soluble OB receptor. This 
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splice variant includes the extracellular domain at 
least up to amino acid 798 (of Seq. ID No. 1, for 
example) and has a unique 6 amino acid C-terminus at 
positions 799-804: G K F T I L. 
5 The functional domains of the OB receptor may 

be predicted using the information contained in Bazan et 
al., PNAS-USA £7: 6934-6938 (1990) (incorporated herein 
by reference) . For the present OB receptor, there are 
two hematopoietin domains, a random coil region, the 
10 transmembrane domain, and the intracellular domain. The 
overall geography may be illustrated as follows: 




Box3 



Using the information provided by Bazan, 
15 suora . the domains may be predicted, with essentially an 

error of approximately plus or minus three base pairs 
(as applied to all amino acid location specified for 
purposes of identifying the Bazan predicted domains) . 
The precise locations may be determined empirically by 

20 methods known in the art, such as preparing and 

expressing modified recombinant DMAs. The structural 
characteristics are though to be important for 
maintaining the structural integrity of the molecule, 
and therefore, to the extent that such structure is 

25 important for function, for functional characteristics 
as well. 

The hematopoietin domains (HI and H2) are 
thought to have two fibronectin type 3 repeats each, one 
set of paired cysteine residues each (thought to form a 
30 disulfide bridge) , and one "WSXWS box" (referring to the 
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single letter amino acid abbreviation, with "X" being 
any amino acid) . The fibrinectin type 3 domains may be 
identified by location of a double proline <"PP"), which 
marks the beginning of the second fibronectin type 3 
5 repeat; the actual beginning of such second fibronectin 
type 3 repeat is likely to begin about 3 amino acids 
upstream of that double proline. 

The first hematopoietin domain is likely to 
begin at amino acid 123 (using the numbering according 
10 to Seq. ID No. 1, for example), which is an isoleucine 
residue (I) . The last amino acid of the hematopoietin 
domain is likely to be amino acid 339, which is a lysine 
(K) residue. The two fibronectin type 3 repeats are 
likely to be located at (about) amino acids 123 through 
15 235 and 236 through 339. There is a single pair of 

cysteine residues which likely form a disulfide bridge, 
located at position 131 and position 142. The "WSXWS 
box" is located at position 319 through 323. 

The second hematopoietin domain is likely to 
20 begin at position 428, which is an isoleucine (I) and 
end at position 642 which is a glycine (G) . The paired 
fibronectin type 3 repeats are located at about position 
428 through position 535 and about position 536 through 
about position 642. One pair of cysteines is located at 
25 position 436 and position 447, and the second pair is 
located at position 473 and 488. The "WSXWS box" is 
located at position 622-626. 

Between the first and the second hematopoietin 
domain (amino acids 339-428, approximately) is a region 
30 of unknown functional significance. 

The random coil domain ("RC" between the H2 
and the transmembrane domain, "TM") is likely to begin 
at the amino acid following the end of the second 
hematopoietin domain, and is likely to end at the 
35 beginning of the transmembrane domain. This is likely 
to be from about amino acid 642 through amino acid 839 
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or 841 (with the transmembrane domain beginning at 
position 840 (A) or 842 (L) ) . The intracellular domain 
PIC) is likely to begin at position 861 (L) , 862 (I), 
863 (S) or 864 (H) . 
5 The intracellular domain ("IC") contains three 

regions, or "boxes, " thought to participate in signal 
transduction (two "JAK" boxes and a single "STAT" box, 
"Box 1", "Box 2", and "Box 3"). With respect to the 
numbering of the amino acid positions of the "D" form of 

10 the OB receptor (Seq. ZD No. 7, below), box 1 is located 
at amino acid 871 (F) through 878 (P) . Box 2 is located 
at approximately amino acid number 921 (I) through 
931 (K) , Box 3 on the "D" form is located at approxi- 
mately position 1141 through 1144 (amino acids YMPQ, as 

15 the "STAT" box is typically a conserved region of "YXXQ" 
wherein "X" designates any amino acid) . The intracel- 
lular domain is thought to be responsible for signal 
transduction. One possible mode of action is via 
phosphorylation of various residues. See Ihle et al., 

20 Cell M: 331-334 (1996) (Review article, herein . 
incorporated by reference.) 

One possible mode of action is that upon 
ligand binding (here, OB protein binding) , the OB 
receptor dimerizes with another receptor. A kinase 

25 ("JAK") binds to box 1, and becomes phosphorylated. 
(The JAK may already be bound prior to dimerization. ) 
Also, "STATS" bind to box 3 and become phosphorylated on 
a specific tyrosine. It is thought that this 
phosphorylation results, probably indirectly, in DNA 

30 binding protein production, which results in altered DNA 
transcription, and therefore altered expression. As seen 
below in Example 6, one measurement of the capability of 
an OB receptor to transduce signal is the degree of 
phosphorylation of JAK/STAT molecules. 

35 The C-terminus region is intracellular (of 

cell-bound OB receptor) . The differences in the C- 
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terminus among members of the present OB receptor family 
may result in differences in signal transduction among 
the species. Thus, the present OB receptors include at 
least the extracellular domain which is important for OB 
5 protein ligand binding. Nucleic acids encoding the 

present OB receptors, vectors, and host cells are also 

provided for herein. 

The extracellular domain may be modified and 

still retain the function of ligand binding, 
10 particularly by one or more of the following 

modifications: (a) the random coil domain (as indicated 
above, occuring downstream of the second hematopoietic 
domain through the beginning of the transmembrane 
domain) may be deleted (this may be approximately 
15 positions 642 through 839 or 841); (b) the "WSXWS" box 
may be modified by (i) substitution of the first serine 
with another amino acid, particularly conserved in terms 
of hydrophobicity and/or charge, such as a glycine; (il) 
the last serine may be substituted with another amino 
20 acid, such as a threonine; (iii) the first tryptophan 
may be substituted with another amino acid, for example, 
a tyrosine. 

Human genomic DNA encoding OB receptor protein 
is also provided herein. The genomic DNA has been 
25 localized to human chromosome 1P31, which is believed to 
correspond to mouse chromosome 4, the location of the 

mouse db locus. 

Tissue distribution analysis demonstrates the 
presence of OB receptor nucleic acids is fairly 

30 ubiquitous, and particularly noted in the liver. It is 
also observed in the ovary, and heart; and, to a lesser 
extent, in small intestine, lung, skeletal muscle, 
kidney, and, to an even lesser extent, spleen, thymus, 
prostate, testes, placenta and pancreas (Example 2, 

35 below) . There may also be one or more forms of the OB 
receptor present in serum, such as soluble OB receptor. 
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which may be complexed to one or more forms of the OB 
protein. 

Amino Acid Sequences and Compositions 
5 According to the present invention, novel OB 

protein receptors and DNA sequences coding for all or 
part of such OB receptors are provided. The present 
invention provides purified and isolated polypeptide 
products having part or all of the primary structural 

10 conformation (i.e., continuous sequence of amino acid 
residues) and one or more of the biological properties 
(e.g., immunological properties and in vitro biological 
activity) and physical properties (e.g., molecular 
weight) of naturally-occurring mammalian OB receptor 

15 including allelic variants thereof. The term "purified 
and isolated" herein means substantially free of 
unwanted substances so that the present polypeptides are 
useful for an intended purpose. For example, one may 
have a recombinant human OB receptor substantially free 

20 of human proteins or pathological agents. These 

polypeptides are also characterized by being a product 
of mammalian cells, or the product of chemical synthetic 
procedures or of procaryotic or eucaryotic host 
expression (e.g., by bacterial, yeast, higher plant, 

25 insect and mammalian cells in culture) of exogenous DNA 
sequences obtained by genomic or cDNA cloning or by gene 
synthesis. The products of expression in typical yeast 
(e.g., Saceharomyees cerevisiae l . insect, or procaryote 
(e.g., £♦ coli i host cells are free of association with 

30 any mammalian proteins. The products of expression in 
vertebrate (e.g., non-human mammalian (e.g. COS or CHO) 
and avian) cells are free of association with any human 
proteins. Depending upon the host employed, and other 
factors, polypeptides of the invention may be 

35 glycosylated with mammalian or other eucaryotic 

carbohydrates or may be non-glycosylated. One may modify 
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the nucleic acid so that glycosylation sites are 
included in the resultant polypeptide. One may choose to 
partially or fully deglycosylate a glycosylated 
polypeptide. Polypeptides of the invention may also 
5 include an initial methionine amino acid residue (at 

position -1 with respect to the first amino acid residue 
of the mature po.\y peptide) . 

In addition to naturally-occurring allelic 
forms of OB receptor, the present invention also 
10 embraces other OB receptor products such as polypeptide 
analogs of OB receptor and fragments of OB receptor. 
Following the procedures of the above noted published 
application by Alton et al. (WO 83/04053), one can 
readily design and manufacture genes coding for 
15 microbial expression of polypeptides having primary 
conformations which differ from that herein specified 
for in terms of the identity or location of one or more 
residues (e.g., substitutions, terminal and intermediate 
additions and deletions) . Alternately, modifications of 
20 cDNA and genomic genes may be readily accomplished by 
well-known site-directed mutagenesis techniques and 
employed to generate analogs and derivatives of OB 
receptor. Such products would share at least one of the 
biological properties of mammalian OB receptor but may 
25 differ in others. As examples, projected products of 
the invention include those which are foreshortened by 
e.g., deletions; or those which are more stable to 
hydrolysis (and, therefore, may have more pronounced or 
longer lasting effects than naturally-occurring) ; or 
30 which have been altered to delete one or more potential 
sites for glycosylation (which may result in higher 
activities for yeast-produced products); or which have 
one or more cysteine residues deleted or replaced by, 
e.g., alanine or serine residues and are potentially 
35 more easily isolated in active form from microbial 
systems; or which have one or more tyrosine residues 
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replaced by phenylalanine; or have an altered lysine 
composition (such as those prepared for purposes of 
clerivatization) . Included are those polypeptides with 
amino acid substitutions which are "conservative" 
5 according to acidity, charge, hydrophobicity, polarity, 
size or any other characteristic known to those skilled 
in the art. See generally . Creighton, Proteins, W.H. 
Freeman and Company, N.Y., (1984) 498 pp. plus index, 
passim. One may make changes in selected amino acids so 

10 long as such changes preserve the overall folding or 
activity of the protein, (see Table 1, below) . Small 
amino terminal extensions, such as an araino-terminal 
methionine residue, a small linker peptide of up to 
about 20-25 residues, or a small extension that facili- 

15 tates purification, such as a poly-histidine tract, an 
antigenic epitope or a binding domain, may also be 
present. £££., in general Ford et al., Protein Expression 
and Purification 2: 95-107, 1991, which is herein 
incorporated by reference. 
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Table 1 

fannwrvaH w Amino Arid Substitutions 



Basic: 


arginine 

lysine 
histidine 


Acidic: 


glutamic acid 
aspartic acid 


Polar: 


glutaxnine 
asparaqine 


Hydrophobic: 


leucine 
isoleucine 
valine 


Aromatic: 


phenylalanine 
tryptophan 
tyrosine 


Small: 


glycine 
alanine 
serine 
threonine 
methionine 



5 Also comprehended are polypeptide fragments 

duplicating only a part of the continuous amino acid 
sequence or secondary conformations within OB receptor, 
which fragments may possess one activity of mammalian 
(particularly human) OB receptor (e.g., immunological 
10 activity) and not others (e.g., OB protein binding 
activity) . 

Of applicability to OB receptor fragments and 
polypeptide analogs of the invention are reports of the 
immunological activity of synthetic peptides which 
15 substantially duplicate the amino acid sequence extant 
in naturally-occurring proteins, glycoproteins and 
nucleoproteins . More specifically, relatively low 
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molecular weight polypeptides have been shown to 
participate in immune reactions which are similar in 
duration and extent to the immune reactions of 
physiologically significant proteins such as viral 
5 antigens, polypeptide hormones, and the like. Included 
among the immune reactions of such polypeptides is the 
provocation of the formation of specific antibodies in 
immunologically active animals. See , e.g.. Lerner et 
al., Cell 21: 309-310 (1891); Ross et al., Nature 23A: 

10 654-656 (1891); Walter et al., PNAS-USA 21: 5197-5200 
(1980); Lerner et al., PNAS-USA, 2&: 3403-3407 (1891); 
Walter et al., PNAS-USA 2SL: 4882-4886 (1891); Wong et 
al., PNAS-USA 22: 5322-5326 (1982); Baron et al., Cell 
2&: 395-404 (1982); Dressman et al., Nature 225.: 185-160 

15 (1982); and Lerner, Scientific American 248 : 66-74 

(1983) . Sfi£, alsp . Kaiser et al. Science 221: 249-255 

(1984) relating to biological and immunological 
activities of synthetic peptides which approximately 
share secondary structures of peptide hormones but may 

20 not share their primary structural conformation. The 
present invention also includes that class of polypep- 
tides coded for by portions of the DNA complementary to 
the protein-coding strand of the human cDNA or genomic 
DNA sequences of OB receptor i.e., "complementary 

25 inverted proteins" as described by Tramontano et al. 

Nucleic Acid Res. 12.: 5049-5059 (1984) . Polypeptides or 
analogs thereof may also contain one or more amino acid 
analogs, such as peptidomimetics. 

Thus, the present class of OB receptor 

30 proteins includes those having amino acids (according to 
Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896; 

(c) 23-896; 
35 (d) 29-896 

(e) 1-839; 
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(f) 22-839; 

(g) 29-839; 

(h) 1-841; 

(i) 22-841; 
5 (j) 23-841; 

(k) 29-841; 
(1) 1-891; 
(m) 22-891; 
(n) 23-891; 
10 <o) 29-891; 

(p) the amino acids of subparts (1) 
through (o) having the C-terminal amino acid sequence 
beginning at position 892 of OB receptor B (Seq. ID 
No, 3) or C (Seq. ID. No. 5) ; 
15 (q) amino acids of subparts b, c, d, f, 

g, i/ j# m, n, o f and any of (p) lacking a leader 
sequence, which have an N-terminal methionyl residue. 

Also provided is a longer form of an OB 
receptor protein, herein denominated the "D" form, which 
20 has an amino acid sequence selected from among 
(according to Seq. ID No. 7): 

(a) amino acids 1-1165; 

(b) amino acids 22-1165; 

(c) amino acids 23-1165; 
25 (d) amino acids 29-1165; 

(e) amino acids of subparts (b) , (c) or 
(d) having an N-terminal methionyl residue. 

As set forth above, one may prepare soluble 
receptor by elimination of the transmembrane and intra- 
30 cellular regions. Examples of soluble receptors include 
those set forth in Seq. ID Nos. 10 and 13. What is 
thought to be a native, secreted form of a soluble human 
OB receptor is also provided herein. This form of OB 
receptor protein has an amino acid sequence selected 
35 from among (according to Seq. ID No. 13): 

(a) amino acids 1-804; 
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(b) amino acids 22-604; 

(c) amino acids 23-804; 

(d) amino acids 29-804; and, 

(e) amino acids of subparts (b) , (c) or 
5 (d) having an N- terminal methionyl residue. 

In addition, since the C-terminus region of 
the above poly ept ides diverges at position 892 (with 
respect to Seq. ID Nos. 1, 3, 5, 7 and 13) one may 
desire to prepare only the polypeptides which are 
10 divergent: 

(a) those having only amino acids 892-896 
of Seq. ID No. 1; 

<b) those having only amino acids 892-904 
of Seq. ID No. 3; 
15 (c) those having only amino acids 892-958 

of Seq. ID No. 5; 

(d) those having only amino acids 892- 
1165 of Seq. ID No. 7; and, 

(e) those having only amino acids 799-804 
20 of Seq. ID No. 13. 

The above polypeptides which have an 
extracellular domain may be modified, as indicated 
above, and still retain the function of ligand binding. 
Such modification may include one or more of the 
25 following: 

(a) the random coil domain <as indicated 
above, occuring downstream of the second hematopoietic 
domain through the beginning of the transmembrane 
domain) may be deleted (this may be approximately 

30 positions 642 through 839 or 841); 

(b) the "WSXWS" box may be modified by 
(i) substitution of the first serine with another amino 
acid, particularly conserved in terms of hydrophobicity 
and/or charge, such as a glycine; (ii) the last serine 

35 may be substituted with another amino acid, such as a 
threonine; (iii) the first tryptophan may be 
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substituted with another amino acid, for example, a 
tyrosine . 

Thus, the present polypeptides include 
(according to the numbering of Seq. ID No. 7) : 
5 (a) 1-896; 

(b) 22-896; 

(c) 23-896; 
<d) 29-896 
(e) 1-839; 

10 <f) 22-839; 

<g) 29-839; 

(h) 1-841; 

(i) 22-841; 
(j) 23-841; 

15 M 29-841; 

(1) 1-891; 
(m) 22-891; 
<n) 23-891; 
(o) 29-891; 

20 (p) the amino acids of subparts (1) 

through (o) having the C-terrainal amino acids selected 
from the C-terminal amino acids of OB receptor B (Seq. 
ID No. 3), C (Seq. ID. No. 5) and D (Seq ID No. 7); 

(q) the amino acids (according to Seq. ID 

25 No. 13) selected from the group consisting of 22-804; 

23-804 and 29-804; 

(r) amino acids of subparts b, c, d, f, 
g, i, j, kf m, n, o, any of (p) lacking a leader 
sequence, and (q) which have an N-terminal methionyl 
30 residue; and 

(s) amino acids of subparts (a) through 
(r) which above having at least one of the following 
modifications: 

(i) for amino acids of subparts (a) 
35 through (p) and those of subpart (r) which are not amino 
acids according to subpart (q) , deletion of (or 
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substitution of amino acid(s) or other modifications of) 
a random coil domain sequence selected from 

(a) 640 through 839 (using 
the numbering according to Seq. ID No. 1); 

5 (b) 641 through 839; 

<c) 642 through 839; 

(d) 640 through 841; 

(e) 641 through 841; and 
<f> 642 through 841; 

10 (ii) for amino acids of subpart (q) 

and those of subpart (r) which contain the sequence of 
subpart (q) , deletion of of (or substitution of amino 
acid(s) or other modifications of) a random coil domain 
sequence selected from among: 

15 (a) 640 through 804; 

(b) 641 through 804; and, 

(c) 642 through 804; 

and, 

(iii) modification of a "WSXWS" 

20 sequence which is 

(a) substitution of the first 
serine with another amino acid, particularly conserved 
in terms of hydrophobicity and/or charge, such as a 
glycine; 

25 (b) substition of the last 

serine with another amino acid, such as a threonine; 
and 

(c) substitution of the first 
tryptophan with another amino acid, for example, a 
30 tyrosine. 

One may modify the OB receptor to create a 
fusion molecule with other peptide sequence. For 
example, if one desired to "tag" the OB receptor with an 
immunogenic peptide, one could construct a DNA which 
35 would result in such fusion protein. The tag may be at 
the N-terminus. Also, since it is apparent that the 
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C-terminus is not necessary for ligand binding activity, 
one may chemically modify the C-terminus of, for 
example, a soluble OB receptor. One may desire, for 
example, a preparation whereby one or more polymer 
5 molecules such as polyethylene glycol molecules are 

attached. Thus, another aspect of the present invention 
is chemically modified OB receptor protein (also further 
described infra! . 

An example of such "tag" is provided herein 
10 using the C-terminus of a recombinant soluble OB 

receptor. Seq. ID No. 12 provides a "FLAG- tag" version 
of such soluble OB receptor (the nucleic acid sequence 
is provided, which may be transcribed to prepare the 
polypeptide) . Such M FLAG-tag" may also be attached to 
15 the N-terminus or other region of an OB receptor 

protein. This type of "tagging" is useful to bind the 
protein using reagents, such as antibodies, which are 
selective for such tag. Such binding may be for 
detection of the location or amount of protein, or for 
20 protein capturing processes where, for example, an 

affinity column is used to bind the tag, and thus the 
desired protein. Other types of detectable labels, such 
as radioisotopes, light -emitting (e.g., fluorescent or 
phosporescent compounds), enzymatically cleavable, 
25 detectable antibody (or modification thereof) , or other 
substances may be used for such labelling of the present 
proteins. Detecting protein via use of the labels may 
be useful for identifying the presence or amount of OB 
receptor protein or a compound containing such protein 
30 (e.g., OB protein complexed to OB receptor) . Moreover, 
such labelled protein may be useful for distinguishing 
exogenous OB receptor protein from the endogenous form. 



\ 
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Nucleic Acids 

Novel nucleic acid sequences of the invention 
include sequences useful in securing expression in 
5 procaryotic or eucaryotic host cells of polypeptide 
products having at least a part of the primary struc- 
tural conformation and one or more of the biological 
properties of recombinant human OB receptor. The 
nucleic acids may be purified and isolated, so that the 

10 desired coding region is useful to produce the present 
polypeptides, for example, or for diagnostic purposes, 
as described more fully below. DNA sequences of the 
invention specifically comprise: (a) any of the DNA 
sequences set forth in Seq. ID No. 2, 4, 6, 8, 9, 11, 

15 12, and 14 (and complementary strands); (b) a DNA 
sequence which hybridizes (under hybridization 
conditions disclosed in the cDNA library screening 
section below, using the 300 bp PCR fragment as 
described to selectively hybridize to a cDNA encoding an 

20 OB receptor protein in a human liver cDNA library, or 
equivalent conditions or more stringent conditions) to 
the DNA sequence in subpart (a) or to fragments thereof; 
and (c) a DNA sequence which, but for the degeneracy of 
the genetic code, would hybridize to the DNA sequence in 

25 subpart (a) . Specifically comprehended in parts (b) and 
(c) are genomic DNA sequences encoding allelic variant 
forms of human OB receptor and/or encoding OB receptor 
from other mammalian species, and manufactured DNA 
sequences encoding OB receptor, fragments of OB 

30 receptor, and analogs of OB receptor which DNA sequences 
may incorporate codons facilitating transcription and 
translation of messenger RNA in microbial hosts. Such 
manufactured sequences may readily be constructed 
according to the methods of Alton et al., PCT published 

35 application WO 83/04053* 
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Genomic DNA, such as that of Seq. ID No. 9, 
encoding the present OB receptors may contain additional 
non-coding bases, or introns, and such genomic DNAs are 
obtainable by hybridizing all or part of the cDNA, 
5 illustrated in Seq. ID Nos. 2,. 4, 6, 8, 11, and 14 to a 
genomic DNA source, such as a human genomic DNA library. 
Such genomic DNA will encode functional OB receptor 
polypeptide; however, use of the cDNAs may be more 
practicable in that, since only the coding region is 
10 involved, recombinant manipulation is facilitated. The 
intron/exon location of genomic DNA is set forth in Seq. 
ID No. 9, infra. 

Nucleic acid sequences include the 
incorporation of codons which enhance expression by 
15 selected nonmammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or 
intermediate DNA sequences which facilitate construction 
of cloning and/or expression vectors. 
20 The present invention also provides DNA 

sequences coding for polypeptide analogs or derivatives 
of OB receptor which differ from naturally-occurring 
forms in terms as described above. The leader sequence 
DNA may be substituted with another leader sequence for 
25 ease in expression or for other purposes. 

Also, one may prepare antisense nucleic acids 
against the present DNAs. Such antisense nucleic acids 
may be useful in modulating the effects of OB receptor 
protein in vivo . For example, one may prepare an 
30 antisense nucleic acid which effectively disables the 
ability of a cell to produce OB receptor by binding to 
the nucleic acid which encodes such OB receptor. 

DNA sequences of the invention are also 
suitable materials for use as labeled probes in 
35 isolating human genomic DNA encoding OB receptor, as 
mentioned above, and related proteins as well as cDNA 
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and genomic DNA sequences of other mammalian species. 
DNA sequences may also be useful in various alternative 
methods of protein synthesis (e.g., in insect cells) or, 
as described infra , in genetic therapy in humans and 
5 other mammals. DNA sequences of the invention are 

expected to be useful in developing transgenic mammalian 
species which may serve as eucaryotic "hosts" for 
production of OB receptor and OB receptor products in 
quantity. See, generally, Palmiter et al., Science 222 * 

10 809-814 (1983) . 

Victors and Host Cells 

According to another aspect of the present 
invention, the DNA sequences described herein which 

15 encode OB receptor polypeptides are valuable for the 

information which they provide concerning the amino acid 
sequence of the mammalian protein which have heretofore 
been unavailable. Put another way, DNA sequences 
provided by the invention are useful in generating new 

20 and useful viral and circular plasmid DNA vectors, new 
and useful transformed and transfected procaryotic and 
eucaryotic host cells (including bacterial cells, yeast 
cells, insect cells, and mammalian cells grown in 
culture) , and new and useful methods for cultured growth 

25 of such host cells capable of expression of OB receptor 
and its related products. 

The DNA provided herein (or corresponding 
RNAs) may also be used for gene therapy for, example, 
treatment of conditions characterized by the 

30 overexpression of OB protein, such as anorexia or 

cachexia. Alternatively, gene therapy may be used in 
cases where increased sensitivity to OB protein is 
desired, such as in cases where an individual has a 
condition characterized by OB protein receptors 

35 defective in ability to bind or retain the binding of OB 
protein. Currently, vectors suitable for gene therapy 
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(such as retroviral or adenoviral vectors modified for 
gene therapy purposes and of purity and pharmaceutical 
acceptability) may be administered for delivery into the 
lung, for example. Such vectors may incorporate nucleic 
5 acid encoding the present polypeptides for expression in 
a desired location. Gene therapy may involve more than 
one gene for a desired protein or different desired 
proteins. 

Alternatively, one may use no vector so as to 
10 facilitate relatively stable presence in the host. For 
example, homologous recombination of a DNA as provided 
herein or of a suitable transcription or translation 
control region may facilitate integration into or 
expression from a host genome. (This may be performed 
15 for production purposes as well, e.g., U.S. Patent 

No. 5,272,071 and WO 91/09955.) The nucleic acid may be 
placed within a pharmaceutical ly acceptable carrier to 
facilitate cellular uptake, such as a lipid solution 
carrier (e.g., a charged lipid), a liposome, or 
20 polypeptide carrier (e.g., polylysine) . A review 

article on gene therapy is Verma, Scientific American, 
November 1990, pages 68-84 which is herein incorporated 

by reference. 

Thus, the present invention provides for a 

25 population of cells expressing an OB receptor of the 

present OB receptor family. Such cells are suitable for 
transplantation or implantation into an individual for 
therapeutic purposes. For example, one may prepare a 
population of cells to overexpress OB receptor (such as 

30 one identified in the Sequence ID'S or otherwise denoted 
herein) , or to express a desired form of OB receptor, 
such as one which is particularly sensitive to OB 
protein (i.e., a form which has a desired capacity for 
signal transduction) . One may then implant such cells 

35 into an individual to increase that individual's 

sensitivity to OB protein. Such cells may, for example, 
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be liver cells, bone marrow cells, or cells derived from 
umbillical cord. Alternatively, one may wish to use 
overexpressing circulating cells such as blood 
progenitor cells, T cells or other blood cells. For 
5 humans, human cells may be used. Cells may be in the 
form of tissue. Such cells may be cultured prior to 
transplantation or implantation. Such OB receptor 
overexpression, or expression of particularly sensitive 
forms of OB receptor may be accomplished by, for 

10 example, altering the regulatory mechanism for 

expression of OB receptor, such as using homologous 
recombination techniques as described supra . Thus, 
provided is a population of host cells modified so that 
expression of endogenous OB receptor DNA is enhanced. 

15 The cells to be transferred to the recipient 

may be cultured using one or more factors affecting the 
growth or proliferation of such cells if appropriate. 
Hematopoietic factors may be used in culturing 
hematopoietic cells. Such factors include G-CSF, EPO, 

20 MGDF, SCF, Flt-3 ligand, interleukins (e.g., IL1-IL13) , 
GM-CSF, LIF, and analogs and derivatives thereof as 
available to one skilled in the art. 

Nerve cells, such as neurons or glia, may also 
be used, and these may be cultured with neurotrophic 

25 factors such as BDNF, CNTF, GDNF, NT3, or others. 

There may be a co-gene therapy involving the 
transplantation of cells expressing more than one 
desired protein. For example, cells expressing OB 
receptor protein may be used in conjunction, 

30 simultaneously or in serriatim with cells expressing OB 
protein « 

For gene therapy dosages, one will generally 
use between one copy and several thousand copies of the 
present nucleic acid per cell, depending on the vector, 
35 the expression system, the age, weight and condition of 
the recipient and other factors which will be apparent 
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to those skilled in the art. The cellular delivery of 
such protein may be designed to last for a selected 
period of time, such as a period of days, weeks, months 
or years. At the end of the effective time period, the 
5 recipient of such transformed cells may receive another 
"dose" <e.g., transplantation of cells). Cells may be 
selected for their lifespan, their time period of 
expression of the desired protein, or their ability to 
be reisolated from an individual (i.e., for blood cells, 
10 leukaphoresis may be used to retrieve transformed cells 
using markers present on the cell surface) . Vectors may 
be similiarly designed using, for example, viruses which 
have a known period of expression of DNAs contained 
therein . 

15 The desired cells or vectors may be stored 

using techniques, such as freezing, available to those 
in the art. 

Thus, the present invention also contemplates 
a method for administering OB receptor protein to an 
20 individual, wherein the source of said OB receptor 
protein is selected from (i) a population of cells 
expressing OB receptor protein and (ii) a population of 
vectors expressing OB receptor protein. Said OB 
receptor protein may be selected from among those 
25 described herein. Said vectors may be virus vectors 
capable of infecting human cells. Said cells may be 
selected from among tissue or individual cells. Said 
individual cells may be selected from among adipocytes, 
fibroblasts, bone marrow cells, peripheral blood 
30 progenitor cells, red blood cells, and white blood 
cells, including T cells and nerve cells. Said 
population of cells or vectors may be co-administered 
with a population of cells or vectors which express OB 
protein or another desired protein. Said cells or 
35 vectors may be stored for use in an individual. Storage 
may be by freezing \ 
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be liver cells, bone marrow cells, or cells derived from 
umbillical cord. Alternatively, one may wish to use 
overexpressing circulating cells such as blood 
progenitor cells, T cells or other blood cells. For 
5 humans, human cells may be used. Cells may be in the 
form of tissue. Such cells may be cultured prior to 
transplantation or implantation. Such OB receptor 
over expression, or expression of particularly sensitive 
forms of OB receptor may be accomplished by, for 

10 example, altering the regulatory mechanism for 

expression of OB receptor, such as using homologous 
recombination techniques as described supra . Thus, 
provided is a population of host cells modified so that 
expression of endogenous OB receptor DNA is enhanced. 

15 The cells to be transferred to the recipient 

may be cultured using one or more factors affecting the 
growth or proliferation of such cells if appropriate. 
Hematopoietic factors may be used in culturing 
hematopoietic cells. Such factors include G-CSF, EPO, 

20 MGDF, SCF, Flt-3 ligand, interleukins (e.g., IL1-IL13) , 
GM-CSF, LIF, and analogs and derivatives thereof as 
available to one skilled in the art. 

Nerve cells, such as neurons or glia, may also 
be used, and these may be cultured with neurotrophic 

25 factors such as BDNF, CNTF, GDNF, NT3, or others. 

There may be a co-gene therapy involving the 
transplantation of cells expressing more than one 
desired protein. For example, cells expressing OB 
receptor protein may be used in conjunction, 

30 simultaneously or In serriatim with cells expressing OB 
protein . 

For gene therapy dosages, one will generally 
use between one copy and several thousand copies of the 
present nucleic acid per cell, depending on the vector, 
35 the expression system, the age, weight and condition of 
the recipient and other factors which will be apparent 



WO 97/254X4 



PCT/US97/00128 



- 24 - 

to those skilled in the art. The cellular delivery of 
such protein may be designed to last for a selected 
period of time, such as a period of days, weeks, months 
or years. At the end of the effective time period, the 
5 recipient of such transformed cells may receive another 
"dose" (e.g., transplantation of cells). Cells may be 
selected for their lifespan, their time period of 
expression of the desired protein, or their ability to 
be reisolated from an individual (i.e., for blood cells, 
10 leukaphoresis may be used to retrieve transformed cells 
using markers present on the cell surface) . Vectors may 
be similiarly designed using, for example, viruses which 
have a known period of expression of DNAs contained 
therein. 

15 The desired cells or vectors may be stored 

using techniques, such as freezing, available to those 
in the art. 

Thus, the present invention also contemplates 
a method for administering OB receptor protein to an 

20 individual, wherein the source of said OB receptor 
protein is selected from (i) a population of cells 
expressing OB receptor protein and (ii) a population of 
vectors expressing OB receptor protein. Said OB 
receptor protein may be selected from among those 

25 described herein. Said vectors may be virus vectors 
capable of infecting human cells. Said cells may be 
selected from among tissue or individual cells. Said 
individual cells may be selected from among adipocytes, 
fibroblasts, bone marrow cells, peripheral blood 

30 progenitor cells, red blood cells, and white blood 
cells, including T cells and nerve cells. Said 
population of cells or vectors may be co-administered 
with a population of cells or vectors which express OB 
protein or another desired protein. Said cells or 

35 vectors may be stored for use in an individual. Storage 
may be by freezing 
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Complexes 

In addition to the OB receptor protein as 
described herein, one may prepare complexes of OB 
5 receptor protein and OB protein, analog or derivative. 

The OB protein may be selected from those 
described in PCT publication WO 96/05309, above and 
hereby incorporated by reference in its entirety. 
Figure 3 of that publication (Seq. ID No. 4, as cited 

10 therein) depicts the full deduced amino acid sequence 
derived for the human OB gene. The amino acids are 
numbered from 1 to 167. A signal sequence cleavage site 
is located after amino acid 21 {Ala) so that the mature 
protein extends from amino acid 22 (Val) to amino acid 

15 167 (Cys) . For the present disclosure, a different 

numbering is used herein, where the amino acid position 
1 is the Valine residue which is at the beginning of the 
mature protein. 

Generally, the OB protein for use will be 

20 capable of complexing to the OB protein receptor 

selected. Thus, one may empirically test the binding 
capability (to all or part of the extracellular domain 
of the OB receptor as indicated above) to determine 
which OB protein forms may be used. Generally, 

25 modifications generally applicable as indicated above 
for OB receptor protein may also be applied here, and 
that disclosure is incorporated by reference here. As 
set forth in WO 96 05309, OB protein in its native form, 
or fragments (such as enzyme cleavage products) or other 

30 truncated forms, analogs, and derivatives all retain 

biological activity. Such forms may be used so long as 
the form binds to at least a portion of the 
extracellular domain of the present OB receptor 
proteins . 

35 An effective amount of an OB protein, analog 

or derivative thereof may be selected from among 
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according to the amino acid sequence as presented in PCT 
WO 96/05309, Figure 3 numbered so that the first amino 
acid of the mature protein is number 1: 

(a) the amino acid sequence 1-146, 

5 optionally lacking a glutaminyl residue at position 28, 
and further optionally having a methionyl residue at the 
N-terminus; 

(b) an amino acid sequence of subpart 
(a) having a different amino acid substituted in one or 

10 more of the following positions: 4, 8, 32, 33, 35, 48, 
50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 78, 89, 97, 
100, 102, 105, 106, 107, 108, 111, 112, 118, 136, 138, 
142, and 145; 

(c) a truncated OB protein analog 

15 selected from among: (using the numbering of subpart (a) 
above) : 

(i) amino acids 98-146 
(ii) amino acids 1-32 
(iii) amino acids 1-35 
20 (iv) amino acids 40-116 

(v) amino acids 1-99 and 112-146 
(vi) amino acids 1-99 and 112-146 
having one or more of amino acids 100-111 
sequentially placed between amino acids 99 and 112; 
25 and, 

(vii) the truncated OB analog of 
subpart (i) having one or more of amino acids 100, 
102, 105, 106, 107, 108, 111, 112, 118, 136, 138, 
142, and 145 substituted with another amino acid; 

30 (viii) the truncated analog of subpart 

(ii) having one or more of amino acids 4, 8 and 32 
substituted with another amino acid; 

(ix) the truncated analog of subpart 
(iv) having one or more of amino acids 50, 53, 60, 

35 64, 66, 67, 68, 71, 74, 77, 78, 89, 97, 100, 102, 
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105, 106, 107, 108, 111 and 112 replaced with 
another amino acid; 

(x) the truncated analog of subpart 
<v> having one or more of amino acids 4, 8, 32, 33, 
5 35, 48, 50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 

78, 89, 97, 112, 118, 136, 138, 142, and 145 
replaced with another amino acid; 

(xi) the truncated analog of subpart 
<vi) having one or more of amino acids 4, 8,32, 33, 
10 35, 48, 50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 

78, 89, 97, 100, 102, 105, 106, 107, 108, 111, 
112, 118, 136, 138, 142, and 145 replaced with 
another amino acid; 

(xii) the truncated analog of any of 
15 subparts (i)-(xi) having an N-terminal methionyl 

residue; and 

(d) the OB protein or analog derivative 
of any of subparts (a) through {c} comprised of a 
chemical moiety connected to the protein moiety; 
20 (e) a derivative of subpart (d) wherein 

said chemical moiety is a water soluble polymer moiety; 

(f) a derivative of subpart (e) wherein 
said water soluble polymer moiety is polyethylene 
glycol; 

25 (g) A derivative of subpart {f) wherein 

said water soluble polymer moiety is a polyamino acid 
moiety; 

(h) a derivative of subpart (g) wherein 
said water soluble polymer moiety is attached at solely 

30 the N-terminus of said protein moiety; 

(i) an OB protein, analog or derivative 
of any of subparts (a) through (h) in a pharmaceutically 
acceptable carrier. 

OB proteins, analogs and related molecules are 
35 also reported in the following publications; however, no 



r 
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representation is made with regard to the activity of 
any composition reported: 

U.S. Patent Nos. 5,521,263; 5,532,336; 
5,552,522; 5,552,523; 5,552,524; 5,554,727; 
5 5,559,208; 5,563,243; 5,563,244; 5,563,245; 

5,567,678; 5,567,803; 5,569,744; 5,569,743 

(all assigned to Eli Lilly and Company) ; 
PCT W096/23517; W096/23515; W096/23514; 
WO96/24670; W096/23513; W096/23516; 
10 W096/23518; W096/23519; WO96/23520; 

W096/23815; WO96/24670; W096/27385 (all 
assigned to Eli Lilly and Company); 
PCT WO96/22308 (assigned to Zymogenetics) ; 
PCT WO96/29405 (assigned to Ligand 
15 Pharmaceuticals, Inc.); 

PCT W096/31526 (assigned to Amyin 

Pharmaceuticals, Inc. ) ; 
PCT W096/34885 (assigned to Smithkline Beecham 
PLC) ; 

20 PCT W096/35787 (assigned to Chiron); 

EP 0 725 079. (assigned to Eli Lilly and 
Company) ; 

EP 0 725 078 (assigned to Eli Lilly and 
Company) ; 

25 EP 0 736 599 (assigned to Takeda) ; 

EP 0 741 187 (assigned to F. Hoffman LaRoche) . 



To the extent these references provide for 
useful OB proteins or analogs or derivatives thereof, or 

30 associated compositions or methods, such compositions 
and/or methods may be used in conjunction with the 
present OB receptor proteins, such as for co- 
administration (together or separately, in a selected 
dosage schedule) or by complex ing compositions to the 

35 present OB protein receptors. With the above provisos, 
these publications are herein incorporated by reference. 
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Derivatives and Formulations 

The present OB protein receptor and/or OB 
protein (herein the term "protein" is used to include 
5 "peptide" and OB protein or receptor analogs, such as 
those recited infra , unless otherwise indicated) may 

also be derivatized by the attachment of one or more 
chemical moieties to the protein moiety. If the present 
pharmaceutical compositions contain as the active 

10 ingredient a complex of OB protein receptor and OB 
protein, one or both of such proteins may be 
derivatized. The chemically modified derivatives may be 
further formulated for intraarterial, intraperitoneal, 
intramuscular, subcutaneous, intravenous, oral, nasal, 

15 pulmonary, topical or other routes of administration. 
Chemical modification of biologically active proteins 
has been found to provide additional advantages under 
certain circumstances, such as increasing the stability 
and circulation time of the therapeutic protein and 

20 decreasing immunogenicity . S^e U.S. Patent 

No. 4,179,337, Davis et al., issued December 18, 1979. 
For a review, see Abuchowski et al., In Enzymes as 
Drugs. (J.S. Holcerberg and J. Roberts, eds. 
pp. 367-383 (1891)). A review article describing 

25 protein modification and fusion proteins is Francis, 

Focus on Growth Factors 3: 4-10 (May 1992) (published by 
Mediscript, Mount view Court, Friern Barnet Lane, London 
N20, OLD, UK) . 

Preferably, for therapeutic use of the 

30 end-product preparation, the chemical moiety for 

derivatization will be pharmaceutical ly acceptable. A 
polymer may be used. One skilled in the art will be 
able to select the desired polymer based on such 
considerations as whether the polymer/protein conjugate 

35 will be used therapeutically, and if so, the desired 

dosage, circulation time, resistance to proteolysis, and 
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other considerations. For the present proteins and 
peptides, the effectiveness of the derivatization may be 
ascertained by administering the derivative, in the 
desired form (i.e., by osmotic pump, or by injection or 
5 infusion, or, further formulated for oral, pulmonary or 
nasal delivery, for example) , and observing biological 
effects as described herein. 

The chemical moieties suitable for 
derivatization may be selected from among various water 
10 soluble polymers. The polymer selected should be water 
soluble so that the protein to which it is attached so 
that it is miscible in an aqueous environment, such as a 
physiological environment. The water soluble polymer 
may be selected from the group consisting of, for 
15 example, polyethylene glycol, copolymers of ethylene 
glycol /propylene glycol, carboxymethylcellulose, 
dextran, polyvinyl alcohol, polyvinyl pyrolidone, 
poly-1, 3-dioxolane, poly-1, 3, 6-trioxane, 
ethylene/maleic anhydride copolymer, polyaminoacids 
20 (either homopolymers or random or non-random copolymers 
f See supra regarding fusion molecules) , and dextran or 
polytn-vinyl pyrolidone) poly ethylene glycol, propylene 
glycol homopolymers, polypropylene oxide/ethylene oxide 
co-polymers, polyoxyethylated polyols, 
25 polystyrenemaleate and polyvinyl alcohol. Polyethylene 
glycol propionaldenhyde may have advantages in 
manufacturing due to its stability in water. 

Fusion proteins may be prepared by attaching 
polyaminoacids to the OB protein receptor or OB protein 
30 (or analog or complex) moiety. For example, the 

polyamino acid may be a carrier protein which serves to 
Increase the circulation half life of the protein. For 
the present therapeutic or cosmetic purposes, such 
polyamino acid should be those which do not create 
35 neutralizing antigenic response, or other adverse 

response. Such polyamino acid may be selected from the 
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group consisting of serum album (such as human serum 
albumin) , an antibody or portion thereof (such as an 
antibody constant region, sometimes called "Fc") or 

other polyamino acids. As indicated below, the location 
5 of attachment of the polyamino. acid may be at the 

N-terminus of the OB protein moiety, or other place, and 
also may be connected by a chemical "linker" moiety to 
the OB protein. 

The polymer may be of any molecular weight, 

10 and may be branched or unbranched. For polyethylene 

glycol, the preferred molecular weight is between about 
2 kDa and about 100 kDa (the term "about" indicating 
that in preparations of polyethylene glycol, some 
molecules will weigh more, some less, than the stated 

15 molecular weight) for ease in handling and 

manufacturing. Other sizes may be used, depending on 
the desired therapeutic profile (e.g., the duration of 
sustained release desired, the effects, if any on 
biological activity, the ease in handling, the degree or 

20 lack of antigenicity and other known effects of the 

polyethylene glycol to a therapeutic protein or analog) . 

The number of polymer molecules so attached 
may vary, and one skilled in the art will be able to 
ascertain the effect on function. One may mono- 

25 derivatize, or may provide for a di-, tri-, tetra- or 
some combination of derivatization, with the same or 
different chemical moieties (e.g., polymers, such as 
different weights of polyethylene glycols) . The 
proportion of polymer molecules to protein (or peptide) 

30 molecules will vary, as will their concentrations in the 
reaction mixture. In general, the optimum ratio (in 
terms of efficiency of reaction in that there is no 
excess un re acted protein or polymer) will be determined 
by factors such as the desired degree of derivatization 

35 (e.g., mono, di-, tri-, etc.), the molecular weight of 
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the polymer selected, whether the polymer is branched or 
unbranched, and the reaction conditions. 

The chemical moieties should be attached to 
the protein with consideration of effects on functional 
5 or antigenic domains of the protein. There are a number 
of attachment methods available to those skilled in the 
art. E.g. , EP 0 401 384 herein incorporated by reference 
(coupling PEG to G-CSF), fififi also Malik et al., Exp. 
Hematol. 2Q.'< 1028-1035 (1992) (reporting pegylation of 
10 GM-CSF using tresyl chloride) . For example, 

polyethylene glycol may be covalently bound through 
amino acid residues via a reactive group, such as, a 
free amino or carboxyl group. Reactive groups are those 
to which an activated polyethylene glycol molecule (or 
15 other chemical moiety) may be bound. The amino acid 
residues having a free amino group may include lysine 
residues and the K-terminal amino acid residue. Those 
having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues, and the C-terminal 
20 amino acid residue. Sulfhydrl groups may also be used 
as a reactive group for attaching the polyethylene 
glycol molecule is) (or other chemical moiety). 
Preferred for therapeutic manufacturing purposes is 
attachment at an amino group, such as attachment at the 
25 N-terminus or lysine group. Attachment at residues 
important for receptor binding should be avoided if 
receptor binding is desired. 

One may specifically desire N-terminally 
chemically modified protein. Using polyethylene glycol 
30 as an illustration of the present compositions, one may 
select from a variety of polyethylene glycol molecules 
(by molecular weight, branching, etc.), the proportion 
of polyethylene glycol molecules to protein molecules in 
the reaction mix, the type of pegylation reaction to be 
35 performed, and the method of obtaining the selected 

N-terminally pegylated protein. The method of obtaining 
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the N-terminally pegylated preparation (i.e., 
separating this moiety from other monopegylated moieties 
if necessary) may be by purification of the N-terminally 
pegylated material from a population of pegylated 
5 protein molecules. Selective N-terminal chemical 

modification may be accomplished by reductive alkylation 
which exploits differential reactivity of different 
types of primary amino groups (lysine versus the 
N-terminal) available for derivatization in a particular 

10 protein. £££ PCT WO 96/11953, herein incorporated by 
reference. Under the appropriate reaction conditions, 
substantially selective derivatization of the protein at 
the N-terminus with a carbonyl group containing polymer 
is achieved. For example, one may selectively 

15 N-terminally pegylate the protein by performing the 

reaction at a pH which allows one to take advantage of 
the pK a differences between the e-amino group of the 

lysine residues and that of the a-amino group of the 
N-terminal residue of the protein. By such selective 

20 derivatization, attachment of a polymer to a protein is 
controlled: the conjugation with the polymer takes 
place predominantly at the N-terminus of the protein and 
no significant modification of other reactive groups, 
such as the lysine side chain amino groups, occurs. 

25 Using reductive alkylation, the polymer may be of the 
type described above, and should have a single reactive 
aldehyde for coupling to the protein. Polyethylene 
glycol prop ion aldehyde, containing a single reactive 
aldehyde, may be used. 

30 An N-terminally chemically modified derivative 

is preferred (over other forms of chemical modification) 
for ease in production of a therapeutic. N-terminal 
chemical modification ensures a homogenous product as 
characterization of the product is simplified relative 

35 to di-, tri- or other multi-derivatized products. The 

use of the above reductive alkylation process for 

\ 

\ 
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preparation of an N-terminally chemically modified 
product is preferred for ease in commercial 
manufacturing. 

In yet another aspect of the present 
5 invention, provided are methods of using pharmaceutical 
compositions of the proteins, and derivatives. Such 
pharmaceutical compositions may be for administration by 
injection, or for oral, pulmonary, nasal, transdermal or 
other forms of administration. In general, comprehended 
10 by the invention are pharmaceutical compositions 

comprising effective amounts of protein or derivative 
products of the invention together with pharmaceutical ly 
acceptable diluents, preservatives, solubilizers, 
emulsifiers, adjuvants and/or carriers. Such 
15 compositions include diluents of various buffer content 
(e.g., Tris-HCl, acetate, phosphate), pH and ionic 
strength; additives such as detergents and solubilizing 
agents (e.g., Tween 80, Polysorbate 80), anti-oxidants 
(e.g., ascorbic acid, sodium metabi sulfite) , 
20 preservatives (e.g., Thimersol, benzyl alcohol) and 
bulking substances (e.g., lactose, mannitol) ; 
incorporation of the material into particulate 
preparations of polymeric compounds such as polylactic 
acid, polyglycolic acid, etc. or into liposomes. Sfifi, 
25 e.g. . PCT W096/29989, Collins et al., "Stable protein: 
phospholipid compositions and methods," published 
October 3, 1996, herein incorporated by reference. 
Hylauronic acid may also be used, and this may have the 
effect of promoting sustained duration in the 
30 circulation. Such compositions may influence the 

physical state, stability, rate of in Yly^ release, and 
rate of in vivo clearance of the present proteins and 
derivatives. Sfi£, e^g., Remington's Pharmaceutical 
Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, 
35 PA 18042) pages 1435-1712 which are herein incorporated 
by reference. The compositions may be prepared in 
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lyophilized form. Implantable sustained release 
formulations are also contemplated, as are transdermal 
formulations. 

5 Specifically contemplated are oral dosage 

forms of the above derivatized proteins. Protein may be 
chemically modified so that oral delivery of the 
derivative is efficacious. Generally, the chemical 
modification contemplated is the attachment of at least 

10 one moiety to the protein (or peptide) molecule itself, 
where said moiety permits (a) inhibition of proteolysis; 
and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall 
stability of the protein and increase in circulation 

15 time in the body. ,£££ PCT W095/21629, Habberfield, 
"Oral Delivery of Chemically Modified Proteins" 
(published August 17, 1995) herein incorporated by 
reference, and U.S. Patent No. 5,574,018, Habberfield et 
al., "Conjugates of Vitamin B12 and Proteins," issued 

20 November 12, 1996, herein incorporated by reference. 

Also contemplated herein is pulmonary delivery 
of the present protein, or derivative thereof. The 
protein (derivative) is delivered to the lungs of a 
mammal while inhaling and traverses across the lung 

25 epithelial lining to the blood stream. See, PCT 

WO94/20069, Niven et al. f ."Pulmonary administration of 
granulocyte colony stimulating factor," published 
September 15, 1994, herein incorporated by reference. 

Nasal delivery of the protein (or analog or 

30 derivative) is also contemplated. Nasal delivery allows 
the passage of the protein to the blood stream directly 
after administering the therapeutic product to the nose, 
without the necessity for deposition of the product in 
the lung. Formulations for nasal delivery include those 

35 with absorption enhancing agent s, such as dextran or 
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cyclodextran. Delivery via transport across other 
raucous membranes is also contemplated. 

Dosages 

5 One skilled in the art will be able to 

ascertain effective dosages by administration and 
observing the desired therapeutic effect. Preferably, 
the formulation of the molecule or complex in a 
pharmaceutical composition will be such that between 
10 about .10 ug/kg/day and 10 mg/kg/day will yield the 

desired therapeutic effect. The effective dosages may 
be determined using diagnostic tools over time. For 
example, a diagnostic for measuring the amount of OB 
protein or OB receptor protein in the blood (or plasma 
15 or serum) may first be used to determine endogenous 
levels of OB protein (or receptor) . Such diagnostic 
tool may be in the form of an antibody assay, such as an 
antibody sandwich assay. The amount of endogenous OB 
receptor protein (such as soluble receptor) is 
20 quantified initially, and a baseline is determined. The 
therapeutic dosages are determined as the quantification 
of endogenous and exogenous OB receptor protein (that 
is, protein, analog or derivative found within the body, 
either self-produced or administered) is continued over 
25 the course of therapy. The dosages may therefore vary 
over the course of therapy, with a relatively high 
dosage being used initially, until therapeutic benefit 
is seen, and lower dosages used to maintain the 
therapeutic benefits. 
30 During an initial course of therapy of an 

obese person, dosages may be administered whereby weight 
loss and concomitant fat tissue decrease increase is 
achieved. Once sufficient weight loss is achieved, a 
dosage sufficient to prevent re-gaining weight, yet 
35 sufficient to maintain desired weight or fat mass may be 
administered. These dosages can be determined 
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empirically, as the effects of OB protein are 
reversible. E.g. . Carapfield et al., Science 269 : 546-549 
(1995) at 547. Thus, if a dosage resulting in weight 
loss is observed when weight loss is not desired, one 
5 would administer a lower dose, yet maintain the desired 
weight . 

Therapeutic Compositions and Methods 

The present OB receptor proteins, alone, or in 

10 combination with an OB protein, and nucleic acids may be 
used for methods of treatment, or for methods of 
manufacturing medicaments for treatment. Such treatment 
includes conditions characterized by excessive 
production of OB protein, wherein the present OB 

15 receptors, particularly in soluble form, may be used to 
complex to and therefore inactivate such excessive OB 
protein. Or, such OB receptor protein, particularly in 
soluble form, may act to protect the activity of OB 
protein. While not wishing to be bound by theory, one 

20 may postulate that OB protein receptor agonist activity 
may be accomplished by a protective effect achieved when 
OB protein receptor (particularly soluble receptor) is 
complexed to OB protein. Such effect may prolong the 
serum half life of OB protein in vivo . Such treatments 

25 may be accomplished by preparing soluble receptor (e.g., 
use of an extracellular domain as described supxa) and 
administering such composition to an individual in need 
thereof or by preparation of a population of cells 
containing or expressing such OB receptor, and 

30 transplanting such cells into the individual in need 
thereof. 

The present OB receptors may also be used for 
treatment of those having defective OB receptors. For 
example, one may treat an individual having defective OB 
35 receptors by preparation of a population of cells 
containing such non-defective OB receptor, and 
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transplanting such cells into an individual. Or, an 
individual may have an inadequate number of OB 
receptors, and cells containing such receptors may be 
transplanted in order to increase the number of OB 
5 receptors available to an individual. 

The present OB receptor proteins and related 
compositions such as OB receptor protein/OB protein 
complex, provide for weight loss, fat loss, increase in 
lean mass, increase in insulin sensitivity, increase in 
10 overall strength, increase in red blood cells (and 

oxygenation in the blood) , decrease in bone resportion 
or osteoporosis, decreased or maintained serum 
cholesterol level, decreased or maintained triglyceride 
(LDL or VLDL) levels, prevention or reduction in 
15 arterial plaque formation, treatment of hypertension, 

and prevention or reduction of gall stone formation. As 
body fat composition may be correlated with certain 
types of cancers, the present compositions may be useful 
for the prevention or amelioration of certain types of 
20 cancers. The present invention also includes methods 
for manufacture of a medicament for use in conjunction 
with the cosmetic/therapeutic conditions described 
herein, containing at least one of the present 
compositions. 

25 The present compositions and methods may be 

used in conjunction with other medicaments, such as 
those useful for the treatment of diabetes (e.g., 
insulin or analogs thereof, thiazolidinediones or other 
antihyperglycemic agents, and possibly amylin or 

30 antagonists there of) , cholesterol and blood pressure 
lowering medicaments (such as those which reduce blood 
lipid levels or other cardiovascular medicaments) , and 
activity increasing medicaments (e.g., amphetamines). 
Appetite suppressants may also be used (such as 

35 serotonin modulators and neuropeptide Y antagonists) . 
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Such administration may be simultaneous or may be In 

seriatim. 

In addition, the present methods may be used 
in conjunction with surgical procedures, such as 
5 cosmetic surgeries designed to alter the overall 
appearance of a body (e.g., liposuction or laser 
surgeries designed to reduce body mass, or implant 
surgeries designed to increase the appearance of body 
mass) . The health benefits of cardiac surgeries, such 

10 as bypass surgeries or other surgeries designed to 

relieve a deleterious condition caused by blockage of 
blood vessels by fatty deposits, such as arterial 
plaque, may be increased with concomitant use of the 
present compositions and methods. Methods to eliminate 

15 gall stones, such as ultrasonic or laser methods, may 
also be used either prior to, during or after a course 
of the present therapeutic methods. Furthermore, the 
present methods may be used as an adjunct to surgeries 
or therapies for broken bones, damaged muscle, or other 

20 therapies which would be improved by an increase in lean 
tissue mass. 

In yet another aspect, the present invention 
provides for methods of manufacture of a medicament for 
the treatment of obesity, type II diabetes, excess blood 

25 lipid, or cholesterol levels, increasing sensitivity to 
insulin, increasing lean mass, and other conditions as 
set forth above. Also provided are solely cosmetic 
treatments for individuals wishing to improve appearance 
by weight loss, and more specifically, loss of fat 

30 deposits, even in the absence of any therapeutic 
benefit. 

Diagnostic Compositions and MPthods 

As indicated supra . polypeptide products of 
35 the invention may be "labeled" by association with a 
detectable marker substance (e.g., radiolabeled with 

\ 
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125J, fluorescent, chemi luminescent, enzyme) to provide 
reagents useful in detection and quantification of OB 
receptor (or complexes) in solid tissue and fluid 
samples such as blood or urine. Nucleic acid products 
5 of the invention may also be labeled with detectable 
markers (such as radiolabels and non-isotopic labels 
such as biotin) and employed in hybridization processes 
to locate the human OB receptor gene position and/or the 
position of any related gene family in a chromosomal 

10 map. Nucleic acid sequences which selectively bind the 
human OB receptor gene are useful for this purpose. 
They may also be used for identifying human' OB receptor 
gene disorders at the DNA level and used as gene markers 
for identifying neighboring genes and their disorders. 

15 Such nucleic acid sequences may be sued for detection or 
measurement of OB receptor mRNA level from a biological 
sample. Contemplated herein are kits containing such 

labelled materials. 

The protein and/or nucleic acids provided 
20 herein may also be embodied as part of a kit or article 
of manufacture. Contemplated is an article of 
manufacture comprising a packaging material and one or 
more preparations of the presently provided 
compositions. Such packaging material will comprise a 
25 label indicating that the protein or nucleic acid 

preparation is useful for detecting and/or quantifying 
the amount of OB receptor in a biological sample, or OB 
receptor defects in a biological sample. As such, the 
kit may optionally include materials to carry out such 
30 testing, such as reagents useful for performing DNA or 
RNA hybridization analysis, or PGR analysis on blood, 
urine, or tissue samples. 

A further embodiment of the invention is 
selective binding molecules, such as monoclonal 
35 antibodies selectively binding OB receptor. The 
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hybridoma technique described originally by Kohler and 
Milstein Eur. J. Immunol. £, 511-519 (1976) has been 
widely applied to produce hybrid cell lines that secrete 
high levels of monoclonal antibodies against many 
5 specific antigens. Recombinant antibodies, ( see Huse et 
al., Science 246 : 1275 (1989)) may also be prepared. 
Such recombinant antibodies may be further modified, 
such as by modification of complementarity determining 
regions to increase or alter affinity, or "humanizing" 

10 such antibodies. Such antibodies may be incorporated 
into a kit for diagnostic purposes, for example. A 
diagnostic kit may be employed to determine the location 
and/or amount or OB receptor of an individual. 
Diagnostic kits may also be used to determine if an 

15 individual has receptors which bind OB protein, or those 
which, to varying degrees, have reduced binding capacity 
or ability. As stated infra , such antibodies may be 

prepared using immunogenic portions of an OB receptor 
protein. Such selective binding molecules may 

20 themselves be alternatives to OB protein, and may be 
formulated for pharmaceutical composition. 

Such proteins and/or nucleic acids may be used 
for tissue distribution assays (for example, as provided 
in the working example below) or for other assays to 

25 determine the location of OB receptor. 

The present OB receptor protein family may be 
used in methods to obtain OB protein analogs, mimetics 
or small molecules. One would simply prepare a desired 
OB receptor protein, particularly one with capability of 

30 binding to native OB protein, and assay the test 

molecule, which may be labelled with a detectable label 
substance, for ability to bind to such receptor. Other 
parameters, such as affinity, and location of binding, 
may also be ascertained by methods available to those 

35 skilled in the art. For example, one could use portions 
of the present OB receptors, particularly portions in 
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the extracellular domain which are necessary for ligand 
binding, to determine the location of such binding. One 
could prepare OB receptors which have various 
truncations or deletions of regions of the extracellular 
5 domain which could be used to determine the location of 
test molecule binding. One could use an OB receptor 
known to be defective in native OB binding, such as 
potentially one from an individual having such defective 
receptors, and use this as the basis for ascertaining OB 
10 protein which would be effective to result in desired 
biological activity (i.e., weight loss, reduction in 
blood dyslipidemias or lowering of cholesterol levels, 
reduction in incidence or severity of diabetes) . Other 
uses include solely cosmetic uses for alteration of body 
15 appearance, particularly the removal of fat. 

The present OB receptor protein or nucleic 
acids may also be useful to identify substances which 
"up-regulate H OB protein or receptor. For instance, the 
temporal expression of OB receptor in yixe may be useful 
20 to determine if an administered substance causes an 

increase or decrease in OB receptor. One may conclude 
that an increase in OB receptor expression results in 
modultion of weight or lipid metabolism. 

The divergence in the C-terminus may represent 
25 OB receptors with different signal transduction 

abilities. Therefore the different receptor family 
members may be used for different assays, depending on 
the type of signal transduction observed. It is thought 
that at least a portion of the intracellular domain is 
30 necessary for signal transduction (sea supra ) . 

The following examples are offered to more 
fully illustrate the invention, but are not to be 
construed as limiting the scope thereof. 
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EXAMPLE l! IDENTIFICATION OF HUMAN OB RECEPTOR PROTEIN 

Human OB receptor protein DNA was identified 
5 in a human liver cDNA library in two steps. The first 
step used two primers in polymerase chain reaction (PCR) 
to amplify a selected 300 base pair region from the 
human liver cDNA library. The second step used the PCR 
fragment as a probe to screen the human liver cDNA 

10 library. Thirteen clones were obtained, but these were 
incomplete at the 5' end. A procedure was performed to 
complete the 5* end to make complete clones. Twelve 
clones were sequenced. These twelve clones were 
identified as either "A", "B" or W C" as denoted by the 

15 C-terminus of the predicted amino acid sequence. 

Polymerase Chain Reaction. 

The original PCR primer was based on the 5* 
end and the 3' end of a 416 base pair sequence having 
20 GenBank Database Accession No. T73849. This sequence 
was selected on the basis of a known motif present in 
cytokine receptors, "WSXWS". 

The 5* primer had the sequence 73-96 of the 
416 bp sequence. The 3* primer had the sequence 337-360 
25 of the 416 bp sequence. 

These primers were used to probe a human cDNA 
liver library (Stratagene) . Standard methods were used. 

This resulted in a PCR fragment having the 
sequence 73-360 of the 416 bp fragment. 

30 

Hybridization. 

The 300 bp PCR fragment was used to probe a 
human liver cDNA library (Stratagene) using standard 
methods. This second hybridization resulted in 13 
35 positive clones. These were partial clones, incomplete 
at the 5' end. 
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Cfflffl?1 oMnn nf rhf * end - 

Rapid Amplification of cDNA End ("RACE", kit, 

GIBCO/BRL) was used to obtain the full length clones. 

5 

SfflTV pnctna rPRults - 

Sequencing revealed the three types of OB 

receptor DNAs. Of the thirteen clones, 4 clones were 
the M A" type (Seq. ID Nos. 1 and 2); 1 clone was the "B" 
10 type (Seq. ID Nos. 3 and 4) and 4 clones were of the "C" 
type (Seq. ID Nos. 5 and 6). 

As can be seen from the Sequence 
Identifications (below), OB receptor A is 896 amino 
acids long, "B" is 904 amino acids long, and "C" is 958 
15 amino acids long. These different OB receptors are 
identical at amino acid positions 1-891, and diverge 
almost completely beginning at position 892. The leader 
sequence is postulated to be, by hydrophobicity 
analysis, amino acids 1-21 (M-A) , 1-22 (M-F) or 1-28 (M-I) , 
20 with the mature protein beginning at positions 22(F), 
23 (N) or 29 (T). Based on hydrophobicity analysis, the 
leader sequence is most likely to be at positions 1-21 (M 
through A) . Chinese Hamster Ovary Cell rCHO") cell 
production of the secreted form of OB receptor protein 
25 also produced a protein having amino acid number 22 as 
the first amino acid of the mature protein. The 
transmembrane region is likely to begin at either 
position 840 (A) or 842 (L) through position 862(1), 
863 (S) or 864(H). For OB receptor type "A", the last 
30 amino acid is located at position 896 and is a lysine 
(L) . For OB receptor type "B n , the last amino acid is 
located at position 904 and is a glutamine (Q) . For OB 
receptor type "C", the last amino acid is located at 
position 958 and is glutamic acid (E) . 
35 For OB receptor protein type "C", the C- 

terminal region possesses high homology to a known human 

\ 

\ 
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transposable element. From nucleotide 2737 through 2947 
of the present human OB receptor protein type rt C", there 
is a 98.1% homology with a 211 base section of a human 
retrotransposable element described in Ono et al,, Nucl. 
5 Acids Res. 15.: 8725-8737 (1987) (bases 520 through 731, 
SINE-R11, GENBANK accession no. x07417) . 

EXAMPLE 2 : TISSUE DISTRIBUTION 

10 Tissue distribution was ascertained using two 

methods. The first method involved using the entire 
type "A** OB receptor. The second method involved using 
probes which are specific to the C-terminal region of 
the protein. Since these C terminal regions are diver- 

15 gent, the second method detected the tissue distribution 
of the different members of the OB receptor family. 

The first method used a Northern Blot kit 
(Clontech) , using the entire type A OB receptor DNA as a 
probe. The second method used PCR with primers specific 

20 to the nucleic acids encoding the divergent C terminus 
of the three types. Standard methods were used. 

Table 2 shows the results for the Northern 
Blot and the PCR methods. The indicates the 
investigator's subjective determination of the strength 

25 of signal. For the Northern Blot analysis, a triple 
"+++" indicates that a result (a dark "band" on the X- 
ray film) was seen upon overnight exposure of the film. 
A double "++" indicates that bands were seen at two 
weeks of exposure. A single w +" indicates that the 

30 bands were seen after three weeks of exposure. In 

addition, using this method, two molecular weights were 
observed, one at 4 Kb and one at 6.2 Kb. Although 
distribution was ubiquitous, the strongest signals were 
seen for ovary, heart and liver. For the PCR analysis, 

35 OB receptor "A" was seen in all tissue types tested 
(prostate, ovary, small intestine, heart, lung, liver 
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and skeletal muscle), type "B" was seen only in lung and 
liver, and type "C H was seen in ovary, heart, lung and 
liver. 

5 Table 2 

T ^«mft ni»fc r <i™*lon Novttl OB Receptor 




Testis 



Nnrfhprn Blot 



PCR 



Ovary 



+++ 



Small 
Intestine 



++ 



Colon 



10 



15 



20 



Peripheral 
blood 
Leukocyte 




fi y&MT>T.R 3 ; IDENTIFICATION OF HUMAN OB RECEPTOR GENOMIC 
DNA AND CHROMOSOME LOCALIZATION; IDENTIFICATION OF HUMAN 
OB RECEPTOR "D w 

The full length human OB receptor genomic DNA 
was also prepared. OB receptor **A" cDNA, in its 
entirety, was used as a probe against a human genomic 
DNA library, using materials and methods from a 
commercially available kit (Genome Systems, using a 
human genomic library in a PI vector) . A single 
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positive clone was detected. There are introns located 
at (with respect to OB receptor M A" DNA) base pair 
number: 559, 1059, 1350, 1667, 1817, 1937, 2060, 2277, 
2460, 2662, and 2738. 
5 The human OB receptor gene was localized to 

human chromosome 1P31 by FISH analysis (Genome Systems) . 
Human chromosome 1 is thought to correspond to mouse 
chromosome 4C7, which is presumed to be the location of 
the do locus. 

10 A further chromosomal sequence was isolated. 

This chromosomal DNA sequence was isolated from a human 
genomic library as described above. This chromosomal 
sequence encodes what is here denominated human OB 
receptor "D w , and the encoded amino acid sequence is set 

15 forth in SEQ. ID No. 7. A cDNA encoding this amino acid 
sequence is set forth in SEQ. ID No. 8. The chromosomal 
DNA intron/exon junction map is set forth as SEQ. ID 
No. 9. 

As with forms "A", "B", and "C", for the 
20 present form "D" OB receptor protein, the first amino 
acid of the mature protein is likely (using 
hydrophobicity analysis) to begin at position 22 (F) , 23 
(N) or 29 (T) . The last amino acid of the protein is at 
position 1165 and is a valine residue. As with the 
25 other forms, the extracellular domain extends from 

position 22 (F) , 23 (N) or 29 (T) to position 839 (D) or 
841 (G) . The transmembrane domain appears to begin at 
position 840 (A) or 842 (L) . The end of the 
transmembrane domain appears to be located at position 
30 862 (I), 863 (S) or 864 (H) . The C-terminal region, 
beyond the transmembrane region, is likely to be 
involved in signal transduction, and is located at 
position 863 (S) , 864 (H) or 865 (Q) through position 
1165 <V). 

35 The present OB receptor form "D" is identical 

to that published by Tartaglia et al, Cell j£: 1263-1271 
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(December 29, 1995) with the exception of a single amino 
acid change at amino acid position 976 (nucleotide codon 
begining at position 3022) . The present type M D" amino 
acid at position 97 6 is aspartic acid, and the published 
5 amino acid corresponding to the same position is 

alanine. This is a non- conservative substitution, £££ 
Infra , and since the location of the substitution is 
within a region thought important for signal trans- 
duction, this change could affect the function of the 
10 molecule. 

pVAMPLE 4 : PREPARATION OF SOLUBLE OB RECEPTOR 

Three forms of soluble human OB receptor have 
15 been prepared: 

1. Trader + ExtraceJ i »1 «r remain (Seq. 
ID Nos. 10 and 11): A recombinant form of the soluble 
human OB receptor was prepared. This form encompasses, 
in the immature protein, the leader sequence and the 

20 extracellular domain (amino acids 1-839) . The mature 
protein would have the leader sequence deleted, and the 
first amino acid of the mature recombinant soluble human 
OB receptor would be 22 (F) , 23 (N> or 29 (T) • This 
protein was expressed as described below. 

25 2. T^ariPr + ^irarpUular Domain + C- 

t f m»<nai flag (Seq. ID No. 12) : A second form of the 
recombinant soluble human OB receptor was also prepared. 
This form had a "FLAG" tag located at the "C- terminus 
of the protein. The "FLAG" peptide is a useful research 

30 tool as it allows one to follow the protein using an 
antibody which recognizes the "FLAG" peptide. Such 
reagents are commercially available (IBI, New Haven, 
CT) . This protein was expressed as described below. 

3. Native Sp Ufl* Variant (Seq. ID Nos. 

35 13 and 14) : This form is believed to the the 

recombinant form of a naturally occurring secreted, 
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soluble human OB receptor. This form has most of the 
amino acids found in the extracellular domain (amino 
acids 22-798) , and a unique 6 amino acid sequence at the 
carboxyl terminus. Beginning at amino acid position 799 
5 of Seq. ID No. 13, the amino acid sequence of this 

native splice variant human OB receptor protein is "G K 
F T I L. M 



EXAMPLE 5 : PREPARATION OF EXPRESSION VECTORS 

10 

Recombinant human OB receptor expression 
vectors have been prepared for expression in mammalian 
cells. As indicated above, expression may also be in 
non-mammalian cells, such as bacterial cells. The type 

15 "A" cDNA (Seq. ID No. 2) was placed into a commercially 
available mammalian vector (pCEP4, Invitrogen) for 
expression in mammalian cells, including the commercial- 
ly available human embryonic kidney cell line, "293". 

Recombinant human OB receptor expression 

20 vectors have been prepared for expression of recombinant 
soluble OB receptor, consisting of the leader sequence 
and the extracellular domain (Seq. ID Nos. 10 and 11}, 
using the same system as above (the commercialy 
available mammalian vector pCEP4, and "293" cells) . 

25 This recombinant soluble human OB receptor was also 
expressed in CHO cells in a similar way. 

The "FLAG-tagged" form (Seq. ID No. 12) of the 
recombinant soluble human OB receptor, and the M D" form 
(Seq. ID No. 7) were also expressed in "293" cells in a 

30 similar fashion as above. 

Detection of desired protein was accomplished 
using BIACORE (Pharmacia) analysis. This analysis is 
analogous to that described in Bart ley et al., Nature 
i£i: 558-560 (1994) . 

35 Essentially, the BIACORE machine measures 

affinity interactions between two proteins. In this 
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case, the OB protein was immobilized on the machine, and 
conditioned media from cell lines expressing the OB 
receptor was added to the machine. Any receptor protein 
present in the conditioned media bound to the OB protein 
5 surface. The BIACORE machine gave a read-out indicating 
that receptor protein was being expressed. For 
recombinant soluble receptor (Seq. ID No. 10) expression 
in "293" cells, the read-out was 191.0 relative to a 
baseline readout of 0. For recombinant soluble receptor 
10 (SEq. ID No. 10) expression in CHO cells, the read-out 
was 150.9 relative to a baseline readout of 0. For 
recombinant soluble receptor with a C-terminal FLAG- tag 
(Seq. ID. No. 12), the read-out was 172.0 relative to a 
baseline of 0. 

15 For expression in bacterial cells, one would 

typically eliminate that portion encoding the leader 
sequence (e.g., potentially amino acids 1-21, 1-22 or 1- 
28) - One may add an additional methionyl at the 
N-terminus for bacterial expression. Additionally, one 

20 may substitute the native leader sequence with a 
different leader sequence, or other sequence for 
cleavage for ease of expression. 

EXAMPLE 6 : DEMONSTRATION OF SIGNAL TRANSDUCTION 

25 

This example demonstrates that the "D" form is 
active to produce a signal within a cell, whereas in the 
same cell type, the tt A M form does not. The signal 
transduction assay was performed by the use of "293" 

30 cells transiently expressing either the "A" or the "D" 
form (see above for preparation of the "293 n expression 
clones) . Phosphorylation of molecules predicted to be 
involved in signal transduction within the cell was 
examined upon OB protein binding to the OB receptor 

35 protein tested. The results demonstrate that upon 

binding of OB protein to the extracellular domain, the 

\ 

\ 

\ 

\ 
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"D" form of the present OB protein receptor transduces a 
signal sufficient to initiate phosphorylation of 
signalling molecules. 

5 Methods 

1. OB receptor molecules. As indicated 
above, the n A" form {Seq. ID No. 1) and the "D" form 
(Seq. ID. No. 7) were studied. 

2. Expression system. The pCEP 4 system (as 
10 described above) having inserted DNA encoding the "A" 

form (Seq. ID No. 2) or the tt D" form (Seq. ID No. 8) was 
used to transfect "293" cells. These cells did not 
allow for the pCEP4 vector to integrate into the genome, 
so such expression was transient. Non-recombinant 
15 (mock-transfected) cells were also prepared as controls. 

3. Detection Qf phosphorylation- Mock 

transfected cells and cells expressing the "A" form or 
the "D M form were analyzed. Prior to treatment the 
cells were serum- starved by incubation in media with 

20 0.5% serum for 16 hours prior to the treatments. The 

cells were treated with the OB protein (10 mg/ml) for 15 
minutes at 37°C # after which the cells were lysed in 
modified NP40 buffer (50 mM Tris, pH 8.0, 150 mM sodium 
chloride, 1% NP40, 10 mg/ml aprotinin, 5mM EDTA, 200 mM 

25 sodium orthovanadate) . Phosphotyrosine containing 

proteins were iramunoprecipitated (Anti-phosphotyrosine 
antibody 4G10, UBI, Lake Placid, NY), and separated by 
SDS polyacrylamide gel electrophoresis. After 
electrophoresis and electroblotting to membranes the 

30 immunoprecipitates were probed with antibodies to 

various signal transduction molecules. Antibodies to 
STATs, JAKs and EKKs were purchased from Santa Cruz 
Biotechnology Inc. Immune complexes were detected by 
horseradish peroxidase conjugated secondary reagents 

35 using chemi luminescence as described by the manufacturer 
(ECL, Amersham) . As a positive control, 32D cells were 
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treated with IL-3, which is known to activate by 
tyrosine phosphorylation most of the molecules being 
analyzed. 

4. Results . Results are presented in Table 
5 3, below. As can be seen, only the "D" form was able to 
respond to either mouse or human OB protein as detected 
by phosphorylation of JAK and STAT molecules. A "+" 
designation indicates signal was detected, a "- n 
designation means that no signal was observed. 

10 

TABLE 3 



Signal 
/AB* 


293 
Alone 


293/D 
hrOB* 


293/D 
mrOB** 


293/A 
hrOB# 


293/A 
mrOB## 


32D 
IL-3 


STAT1 




+ 










STAT 3 




+ 


+ 






+ 


STATS 




+ 


+ 






+ 


JAK1 




+ 


+ 






+ 


JAK 2 




+ 


+ 






+ 


JAK 3 














TYK2 






+ 








ERKS 
1.2 












+ 

i 



* Antibody detection target 

* 293 cells expressing receptor form H D", treated with 
15 recombinant human OB 

** 293 cells expressing receptor form W D" treated with 

recombinant murine OB 

* 293 cells expressing receptor form "A" treated with 
recombinant human OB 

20 ## 293 cells expressing receptor form "A" treated with 
recombinant murine OB 

The "D" form is capable of initiating 
signalling through the JAK/STAT pathways in 293 cells, 
whereas the "A" form cannot. 
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EXAMPLE 7 ; USE OF SOLUBLE OB RECEPTOR AS A THERAPEUTIC 

This example demonstrates that soluble OB 
5 receptor protein acts to protect the activity of OB 
protein. Below, soluble OB receptor and/or OB protein 
was delivered to a mammal via "gene transplant" — that 
is, via bone marrow cells engineered to express the 
desired ONAs. When soluble OB receptor combined with OB 

10 protein was delivered, the animals lost more weight than 
delivery of OB protein alone. This demonstrates the 
protective activity of OB receptor protein. 

While not wishing to be bound by theory, one 
explanation of the mode of action is that soluble OB 

15 receptor protein acts to protect the OB protein in serum 
from agents or conditions which could diminish its 
activity. The protective action appears to increase 
circulating half-life of the protein. As such, the 
present example demonstrates that OB receptor either 

20 alone, or administered as a complex with OB protein (or 
analog or derivative thereof) could act as a therapeutic 
agent . 

Materials and methods: 
25 1. Preparation of reromhi nant nh retroviral 

vector Packaging Cells. 

Use of murine ob cDNA. Full length wild-type 

murine ob cDNA was amplified by the PCR using synthetic 

oligonucleotides designed from the published sequence 
30 Zhang et al., Nature 372; 425-432 (1994) .Linkers (An Eca 

RI linker and a Bal II linker) were used to facilitate 

subcloning. 

Use of soluble recombinant human OB r eceptor 
£QH&. Methods similar to those above were used. A 
35 construct containing the recombinant human soluble 

receptor of Seq. ID No. 10 was used, and modified with 
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linkers to facilitate cloning (i.e., the addition of a 
Bal II restriction endonuclease recognition site) . 

pi^m^nr n f h,»<H r*d rnNA into vector. PCR 
products were digested with ScoRI and Bglll and cloned 
5 into similarly-digested parental vector (pMSCV2.1) under 
the transcriptional control of the viral LTR promoter. 
The parental MSCV vector (supplied by R. Hawley, 
University of Toronto, Canada) was derived from MESV 
(murine embryonic stem cell virus) and contains a 
10 neomycin phosphotransferase resistance (neo r ) gene 
driven by an internal mouse phosphoglycerate kinase 
(PGK) promoter, as described* Hawley, et al, J. Exp. 
Med. I2£: 1149 -1163 (1992) . The parental plasmid 
pMSCV2.1 and pMSCV-OB were independently elect ropo rated 
15 into the GP+E-86 packaging cell line (supplied by Dr. A. 
Bank, Columbia University, NY) Markowitz et al., J. 
Virol. £2:1120-1124 (1988). Transient supernatants were 
harvested from electroporated populations and used to 
infect tunicamycin treated parental GP+E-86 cells. 
20 Tunicamycin treatment relieves the block to 

superinfection of the parental packaging cells. G418 
(0.78 mg/mL, 67% active, GIBCO Laboratories, Life 
Technologies, Inc., Grand Island, NY) resistant clones 
were selected from each infected population and titered 
25 by infection of NIH3T3 cells. Clones with the highest 
G418 resistant titer were expanded and frozen as 
aliquots. Each bone marrow infection and 
transplantation experiment used aliquots from the same 
passage of frozen viral packaging cells. Both the 
30 parental and oJb packaging cell lines were tested for the 
presence of, and found to be free from, replication 
competent virus using a sensitive marker rescue assay. 
Moore, et al., (1993) in: Gene Targeting: A Practical 
Approach, Joyner, Ed. (Oxford University Press, New 
35 York, NY) ♦ 
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2. PrftrinrMnn of RPl-rnviral Supernatant s . 
Recombinant virus -producing packaging cell lines were 
grown in 175cm 2 tissue culture flasks in Iscove's 
Modified Dulbecco's Medium (IMDM) (GIBCO) # 10% (v/v) 

5 FBS, at 37°C. Sub-confluent (approximately 60%) 

monolayers of cells were fed with fresh medium 24h prior 
to harvest of virus-containing supernatants . Viral 
supernatants were removed from packaging cell lines by 
aspiration, sterile filtered (0.45mM) and added directly 
10 to bone marrow cultures. Fresh aliquots of frozen 
packaging cell lines were thawed for use in each 
experiment . 

3. Bone Marrow Infection and Transpla ntation. 
Eight to 12-week old female C57BL/6J (+/+) or (ob/ob) 

15 mice were used as bone marrow donors and recipients. 

All mice were purchased from The Jackson Laboratory (Bar 
Harbor, ME J and housed under specific pathogen- free 
conditions in a vivarium in accordance with governmental 
regulations and institutional guidelines. 
20 Bone marrow cells were harvested from femurs 

and tibias of donor mice 4 days post 5-f luorouracil (5- 
FU, Sigma Chemical Co., St. Louis, MO) treatment (150 
mg/kg i.v. ). Bone marrow cells (6 X 10^/mL) were 
incubated in 150mm tissue culture dishes (30mL/dish) 
25 containing fresh viral supernatant (as described above), 
15% FBS, 6 mg/mL polybrene (Sigma), 0.1% bovine serum 
albumin (BSA, Fraction V, Sigma), 2.5 ng/mL recombinant 
mouse IL-3 (rmIL-3), 100 ng/mL each of recombinant human 
IL-6 (rhIL-6) , recombinant human IL-11 (rhIL-11) , and 
30 recombinant rat SCF (rrSCF) . All growth factors were 
produced by Amgen, Inc. (Thousand Oaks, CA) . Culture 
media were replaced daily for 3 days with fresh virus- 
containing supernatant and growth factors. 

At the end of the infection period, total non- 
35 adherent and adherent cells were washed and resuspended 
in 1% BSA-saline and transplanted into g- irradiated (12 
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Gy, Cs 137 ) mice. Each animal was transplanted with 2.5 
X 10 6 syngeneic cells. There were approximately 10 

animals per cohort. 

4. analysis of Oft protpin expression in 

5 1-ransfect-gri ra>ils anri transnl antffri animals. For 

transfected bone marrow cells, Western analysis was 
performed. Vector packaging cell supernatant was 
resolved by SDS-PAGE (16% acrylamide) , then transferred 
to Hybond-ECL (Amersham, Arlington Heights, IL) . The 
10 filter was incubated with affinity-purified rabbit a- 
mouse OB protein polyclonal antibody (lmg/mL) in T-TBS 
buffer (20mM Tris -chloride, pH7.6, 13*7mM NaCl, 0.1% 
Tween20) at room temperature for 45 rain. Horseradish 
peroxidase (HRP) -conjugated donkey a-rabbit IgG 
15 (Amersham) was diluted in T-TBS (1:2500) and incubated 
with the filter at room temperature for 45 min. 
Enhanced chemiluminescence (ECL, Amersham) detection was 
performed as recommended by the manufacturer. 

For transplanted animals, serum was analyzed. 
20 Animals were bled retroorbitally, under isofluorane 

anesthesia. Serum from transplanted oh/oh animals was 
resolved by SDS-PAGE (4-20% acrylamide) under non- 
reducing and reducing conditions, then transferred to 
Trans-Blot (Bio-Rad Laboratories, Hercules, CA) 
25 membranes. The membranes were incubated for 2 hours at 
room temperature with HRP- conjugated rabbit a-mouse OB 
protein antibody <0.125mg/mL) in T-TBS buffer containing 
5% fetal bovine serum and 1% bovine serum albumin. 
Bound OB protein was detected by ECL (Amersham) , 
30 performed as recommended by the manufacturer. 

For quantitation of soluble OB protein levels, 
serum from transplanted animals was subjected to ELISA 
analysis. Briefly, affinity-purified rabbit a-OB 
protein polyclonal antibody was coated onto 96-well 
35 plates. Standards (purified recombinant OB protein 

\ 
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Gy, Cs 137 ) mice. Each animal was transplanted with 2.5 
X 10 6 syngeneic cells. There were approximately 10 

animals per cohort. 

4. Analysis rm nrnt.Mn expression in 

5 transfected cells ar>rt transnl anted animals. For 
trans fected bone marrow cells, Western analysis was 
performed. Vector packaging cell supernatant was 
resolved by SDS-PAGE (16% acrylamide) , then transferred 
to Hybond-ECL (Amersham, Arlington Heights, IL) . The 
10 filter was incubated with affinity-purified rabbit a- 
mouse OB protein polyclonal antibody (lmg/mL) in T-TBS 
buffer (20mM Tris-chloride, pH7.6, 137mM NaCl, 0.1% 
Tween20) at room temperature for 45 min. Horseradish 
peroxidase (HRP) -conjugated donkey a-rabbit IgG 
15 (Amersham) was diluted in T-TBS (1:2500) and incubated 
with the filter at room temperature for 45 min. 
Enhanced chemi luminescence <ECL, Amersham) detection was 
performed as recommended by the manufacturer. 

For transplanted animals, serum was analyzed. 
20 Animals were bled retroorbitally, under isofluorane 

anesthesia. Serum from transplanted ob/ob animals was 
resolved by SDS-PAGE (4-20% acrylamide) under non- 
reducing and reducing conditions, then transferred to 
Trans-Blot (Bio-Rad Laboratories, Hercules, CA) 
25 membranes. The membranes were incubated for 2 hours at 
room temperature with HRP-conjugated rabbit a-mouse OB 
protein antibody (0.125mg/mL) in T-TBS buffer containing 
5% fetal bovine serum and 1% bovine serum albumin. 
Bound OB protein was detected by ECL (Amersham) , 
30 performed as recommended by the manufacturer. 

For quantitation of soluble OB protein levels, 
serum from transplanted animals was subjected to ELISA 
analysis. Briefly, affinity-purified rabbit a-OB 
protein polyclonal antibody was coated onto 96-well 
35 plates. Standards (purified recombinant OB protein 
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monomer, Pelleymounter et al., Science 2£1' 540-543 
(1995) and experimental samples were added, and the 
plates were incubated at room temperature. The plates 
were washed twice and affinity-purified rabbit a-OB 
5 protein antibody conjugated to horseradish peroxidase 
was added. Following incubation at room temperature, 
the plates were washed four times with TNE-Tween20. 
TMB/peroxide substrate was added and the color reaction 
was read at 450nm in a Molecular Devices plate reader. 
10 OB protein concentrations in sera were estimated by 
comparison to a standard curve prepared from internal 
standards. OB protein levels were reliably measured in 
samples containing >160 pg/mL. 

5. Body Weight and Food Intake. Mice were 

15 offered pelletized rodent chow (PMI Feeds, Inc., St. 
Louis, MO) ad libitum . The body weight of individual 
animals was measured daily for the first two months of 
analysis, and weekly thereafter. Food consumption was 
measured daily on selected groups of individually-housed 

20 animals. 

Results 

Results are presented in Tables A and 5 below. 
Administration of OB protein receptor increased the 

25 effectiveness of OB protein. This may have been 

accomplished via an increased circulation time of OB 
protein in the presence of OB protein receptor. 

As can be seen in the Table, animals 
administered a combination of OB protein and OB protein 

30 receptor (via genetic therapy) had a greater weight loss 
after 28 days than either composition alone. The Table 

presents the results of two experiments (" / ") . As 

can be seen, use of the OB protein alone at day 40 
resulted in animals with 87.5% and 72.2% of the starting 

35 weight. Using OB receptor in combination with OB 
protein, however, resulted in animals with 66% and 
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53.6% of the starting weight. Use of the receptor alone 
appeared to have little effect, if any. 



TABLE 4 



Treatment 


Weight (g) 
decrease at 
day 28 (ave) 


% starting 
weight 
(ave) 
dav 2B 


% starting 
weight 
(ave) 
dav 40 


OB alone* 


6.3/12.7 


87.9/75.3 


87.5/72.2 


Receptor** 
alone 


tl.4]/[0.3] 


103/100.6 


104.2/101.7 


OB + 
Receptor*** 


12.6/16.8 


76.3/67.5 


68/53.6 



* 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
without genetic alteration 

** 50% bone marrow cells transfected with OB receptor 
10 protein cDNA as described above, and 50% bone marrow 
cells without genetic alteration 

*** 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
transfected with OB receptor protein cDNA as described 
15 above. 

Table 5, below, contains results of the OB 
levels found in the serum from animals administered OB 
protein alone, or administered OB protein in combination 
with OB protein receptor (via the "gene therapy" method 
20 of this example) . The data reflect nanograms of OB 
protein per milliliter of serum, plus or minus the 
standard error of the mean. 



\ 

\ 
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TABLE 5 



Treatment 


Experiment #1* 


Experiment #2tt 


OB alone* 


2.93 +/- 0.77 


9.74 +/- 1.02 


Receptor* * 
alone 


0.08 +/- 0.05 


0.12 +/- 0.07 


OB + 
Receptor*** 


12.11 +/- 1.90 


15.18 +/- 2.52 



* 50% bone marrow cells trans fected with OB protein 
5 cDNA as described above, and 50% bone marrow cells 

without genetic alteration 

** 50% bone marrow cells transfected with OB receptor 
protein cDNA as described above, and 50% bone marrow 
cells without genetic alteration 
10 *** 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
transfected with OB receptor protein cDNA as described 
above . 

* Experiment #1 was conducted as described above, 
15 with OB protein serum levels measured after 38 days. 

it Experiment #2 was also conducted as described 
above, with OB protein serum levels measured after 24 
days. 

The data demonstrate the protective effects of 
20 OB receptor. As can be seen, in the presence of OB 
receptor, OB protein has a higher accumulation in the 
serum. The degree of accumulation is observed to 
increase inversely with the levels of OB protein in the 
serum. In Experiment #1 (with a base OB protein level 
25 of about 2.93 ng/ml) , the OB protein serum level 

increased about 400% with the addition of receptor, 
where in Experiment 12 (with a base of about 9.74), the 
OB protein serum level increased by about 25%. 

OB receptor administered either alone or in 
30 association with OB protein (or analogs or derivatives 
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thereof) may serve to increase the circulation time of 
OB protein, and therefore enhance the therapeutic 
efficacy of either exogenous or endogenous OB protein. 

5 F.VhMPLE B i PREPARATION OF SELECTIVE BINDING MOLECULES 

Animals were immunized for the preparation of 
polyclonal antibodies using the following peptides (with 
respect to the numbering of the amino acids for OB 
receptor A, Seq. ID No. 1): 54-64; 91-100; 310-325; 
10 397-406; 482-496; B74-885; and, with respect to amino 

acids of OB receptor "C w (Seq. ID No. 5), 910-929. Some 
of the polyclonal antibodies prepared (in rabbits) were 
tested for ability to bind to recombinant human OB 
receptor protein. The polyclonal antibody prepared 
15 against amino acids 54-64 was found to have the highest 
affinity for recombinant human OB receptor protein. The 
polyclonal antibody prepared against amino acids 397-406 
was also found to bind to recombinant human OB receptor 
protein. The polyclonal antibody prepared against amino 
20 acids 91-100 was found to slightly bind to recombinant 
human OB receptor protein. The polyclonal antibody 
prepared against amino acids B74-885 was found not to 
bind to recombinant human OB receptor protein. 

An additional study was performed which 
25 demonstrates the expression and purification of the 

extracellular domain of the OB receptor protein in CHO 
cells, and antibodies which recognize this OB protein 
receptor extracellular domain. 

The extracellular domain of the human OB 
30 receptor protein was expressed as a secreted, soluble 
protein in CHO cells as previously described supra . 
Individual cell lines were isolated and grown in 
increasing amounts of methotrexate to increase 
selection/expression of the recombinant receptor protein 
35 (100, 200 or 500 micrograms methotrexate per ml of 

media) . Conditioned media from the CHO cell lines was 
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collected, and the proteins in the conditioned media 
were fractionated by SDS-PAGE. The OB receptor 
extracellular domain migrated as a broad band with an 
apparent size range of about 140 kDa to about 200 kDa. 
5 The OB receptor protein extracellular domain was 
detected by Western Blot analysis using polyclonal 
antibodies prepared against a portion of the extracel- 
lular domain of the OB receptor protein. The unfolded, 
bacterially expressed protein was used as an antigen to 

10 generate antisera in rabbits. The identified OB receptor 
extracellular domain was purified by affinity 
chromatography. The purified protein was sequenced at 
the amino terminus to confirm that it was the OB 
receptor and also to determine the start of the mature 

15 protein (after signal peptide cleavage) as expresed in 
CHO cells, it was found that amino acid no. 22 
(according to the amino acid sequence numbering of Seq. 
ID No. 1, infra ) . was the first amino acid of the mature 
protein as expressed in CHO cells. 

20 Other immunogenic peptides may be used. 

Polyclonal, monospecific polyclonal, monoclonal, 
antibody fragments, and recombinant antibodies may be 
prepared using methods available to those skilled in the 
art. 

25 One may further use recombinant techniques or 

peptide synthesis methods to alter the character of such 
selective binding molecules. This may be accomplished 
by preparing recombinant antibodies having altered 
complementarity determining regions (sometimes referred 

30 to in the art as "CDR's") to, for example "humanize" the 
antibodies by using human F c (constant) regions. Other 

types of recombinant antibodies, for example, those 
having CDR's altered to enhance affinity or selectivity 
to one or more members of the OB receptor family, may be 
35 prepared and used using methods available to those 
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skilled in the art. £££ Winter et al., Nature 212: 293- 
299 (1991). 

The present 06 receptor protein may be used as 
an assay to screen for desired selective binding 
5 molecules. Such assay may be based on binding 

capability, or biological activity, or, other means of 
detecting signal transduction. For example, if one were 
to prepare a series of modified antibodies, one could 
test them for affinity (i.e, binding strength) against 

10 the target OB receptor. 

The selective binding molecules may be useful 
for diagnostic purposes, such as tissue distribution 
analysis, or to diagnose the relative affinity of an 
individual's OB receptors for such selective binding 

15 molecule to determine the functionality of an 

individual's OB receptor during a course of therapy. 
Selective binding molecules may be alternative 
therapeutic or cosmetic products to OB protein. 

20 F.YRMPTiF. 9; CT!MB TRF.RAPY 

One may deliver the present OB receptor 
protein via gene therapy, as described i nfra * 

One may envision, using materials and methods 
available to those skilled in the art and provided 

25 herein, using T-cells as an agent carrying DNA 

expressing OB receptor for gene therapy. An individual 
would have T-cells selected using CD34+ selection and a 
magnetic microparticles selection device. Such cells 
would be transfected with the desired DNA, or the 

30 regulation of the desired coding region may be altered 
using homologous recombination or other in s i tu 
techniques. The transduced cells could be selected 
empirically, using means to detect the desired protein, 
or a marker may be included which permits indirect 

35 detection (i.e., a selectable marker as is known in the 
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art) . Optionally, such cells could be expanded, for 
example, using one or more growth factors such as SCF or 
an interleukin, and such cells could be stored for 
future use. In such a way, the procedure would only 
have to be accomplished once or infrequently in an 
individual's lifetime, for later transfer into the 
individual. The cells would be re-planted into the 
individual, and the individual would be monitored for 
desired therapeutic effect, such as weight 
loss/maintenance of weight, diabetes recurrence, blood 
lipid levels, or other conditions. 

Illustrative Nucleic Acid and Amino Acid Sequences 

The below amino acid and DNA sequences are 
15 those to which reference has been made. An aster ick. {"*") 
indicates the position of a stop codon. 



5 



10 
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finnan OB P^pfnr " ft " amino Arid SftCMftnce fSfiq. TP NO . 1 lAmiPQ 





Acid. 


«*nolA latter abbreviation) ; 






1 


MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP 


NSTYDYFLLP 


5 


51 


AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC 


FRSEQDRNCS 




101 


LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF 


ICYVESLFKK 


10 


151 


LFKNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 




201 


PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP 


LGLHMEITDD 


15 


251 


GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA 


TSLLVDSILP 


301 


GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI 


LTSVGSNVSF 




351 


HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK 


VTFFNLNETK 


20 


401 


PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG 


YLTKMTCRWS 




451 


TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL 


QSDGFYECIF 


25 


501 


QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP 


SSVKAEITIN 


551 


IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD 


AKSKSVSLPV 




601 


PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM 


RGPEFWRIIN 


30 


651 


GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT 


WSEDVGNHTK 




701 


FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI 


VQSLSAYPLN 


35 


751 


SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS 


SVKKYYIHDH 


801 


FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 




851 


SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ 


KRTDIL*SLI 


40 


901 


MITTDEPNVP TSQQSIEY*K IFTF*RRGAN IKKIQLNF*E 


LTYGGLC*FR 




951 


T*NRCVNLGS KCRFESSLDV *L 





\ 

\ 



\ 

\ 
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Human OB Receptor "A" DNA Se quence (Sea. ID No. 2 JDNAt ) : 



1 


CCGCCGCCAT 


CTCTGCCTTC 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


51 


CTTCTCTGAA 


GTAAGATGAT 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


251 


ACATTATGAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


301 


TTTCTAACTT 


ATCCAAAACA 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


501 


TCATTATTTA 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGTTGTGAA 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


801 


GAAATCACAG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


851 


GGTACCATTT 


CCACTTCAAT 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


901 


CAGTTATCAG 


AGAAGCTGAC 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


951 


GACAGTATAC 


TTCCTGGGTC 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


1UU1 


ACTGGATGGC 


CCAGGAATCT 


GGAGTGACTG 


GAG TACTCCT 


CGTGTCTTTA 


1051 


CCACACAAGA 


TGTCATATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


1251 


CTGAATGAAA 


CCAAACCTCG 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 
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1301 CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 
1351 ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG 
1401 ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT 
1451 GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
1501 TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT 
1551 TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG 
1601 GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 
1651 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
1701 GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT 
1751 CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
1801 AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT 
1851 GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
1901 CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 
1951 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
2001 TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT 
2051 ACTTTGGAAG CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
2101 ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG 
2151 GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT 
2201 TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 
2251 TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
2301 GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC 
2351 CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 
2401 AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
2451 TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
2501 CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 
2551 CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT 



/ 
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2601 
2651 
5 2701 
2751 
2601 

10 

2851 
2901 
15 2951 
3001 
3051 

20 

3101 
3151 



GTGCCAGTAA 
ATCACACCAA 
AGAATTGTTC 
CTTTGAAGTC 
CCAACAGTCT 
CAAATCTAAA 
TTATGTTGAT 
TAGATTTGAG 
TTTAAAAGTA 
TCATAAAACA 
AAATGTCATC 
AACTGCAACA 



TTATTTCCTC 
AGAATGAAAA 
CTGGGCACAA 
TAATCATGAT 
ATAGAGTATT 
AAAAATTCAG 
TTAGAACTTA 
TCCAGTTTGG 
GTATTCATGA 
TTAAGAAAAT 
AAATATGTAG 
TCTGACAAAT 
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TTCCATCTTA 
AGCTATTTTG 
GGACTTAATT 
CACTACAGAT 
AGAAGATTTT 
TTGAACTTCT 
AAATAGATGT 
ATGTGTGATT 
TTTCTGGCTT 
TATGGCTGTT 
TAGACAATTT 
TGCTTTAAAA 



TTGCTTGGAA 
GGAAGATGTT 
TTCAGAAGAG 
GAACCCAATG 
TACATTTTGA 
GAGAGT7AAC 
GTAAATTTGG 
AATTTTCAAA 
TTGATTTGCC 
GCTGTCATTA 
TGTAATTAGG 
ATACAATGAT 



CATTATTAAT 
CCGAACCCCA 
AACGGACATT 
TGCCAACTTC 
AGAAGGGGAG 
ATATGGTGGA 
GTTCAAAATG 
TCATCTAAAG 
ATATTCCTGG 
CATATCTATT 
TGAACTCTAA 
TAT 
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Huma n Recepto r w r" Amino AHri secmpincft (Seq. TP No , 3 (Am i no 

5 1 MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 

51 AGLSKNTSNS IJGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS 

101 LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF ICYVESLFKN 

10 151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 

* 

201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 

15 251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 

301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 

351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 

20 401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 

451 T5TIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 

25 501 QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 

551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 

601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRIIN 

651 GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 

701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 

35 751 SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS SVKKYYIHDH 

801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 
851 SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ KKRLSIFLSS 
40 901 IQHQ*HWLF FWSLKQFQKI SVLIHHGKIK MR*CQQLWSL YFQQQILKRV 
951 LFVLVTSSTV LTSLRLRVLR *PMRTKARDN PLLNTPR*SA TLNQVKLVK 

45 
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Human OB Rproptnr "B" DNA fipqnpnre (Sea. ID No . 4 fDNAl ) : 



1 


CCGCCGCCAT 


CTCTGCCTTC 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


51 


CTTCTCTGAA 


GTAAGATGAT 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


251 


ACATTATGAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


301 


TTTCTAACTT 


ATCCAAAACA 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


501 


TCATTATTTA 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGTTGTGAA 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


801 


GAAATCACAG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


851 


GGTACCATTT 


CCACTTCAAT 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


901 


CAGTTATCAG 


AGAAGCTGAC 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


951 


GACAGTATAC 


TTCCTGGGTC 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


1 001 




PPAfiflAATPT 


fifiAfiTfSAPTfi 


RAfiTAPTPPT 




1051 


CCACACAAGA 


TGTCATATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


1251 


CTGAATGAAA 


CCAAACCTCG 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 
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130 X CTGCAATGAA CATGAATGCC 

1351 ATGTCAATAT CAATATCTCA 

5 

1401 ACTTGCAGAT GGTCAACCAG 

1451 GCAATTGAGG TATCATAGGA 

10 1501 TTCATCCCAT ATCTGAGCCC 

1551 TATGAATGCA TTTTCCAGCC 

1601 GATTAGGATC AATCACTCTC 

15 

1651 TCCTTCCTGA TTCTGTGGTG 

1701 GAAATTACTA TAAACATTGG 

20 1751 CTTTCCAGAG AATAACCTTC 

1801 AAGAAGTACA ATGGAAGATG 

1851 GTCAGTCTCC CAGTTCCAGA 

25 

1901 CTGTAAGAGG CTAGATGGAC 

1951 CCTACACAGT TGTCATGGAT 

30 2001 TGGAGAATAA TTAATGGAGA 

2051 ACTTTGGAAG CCCCTGATGA 

2101 ATGTGATAAA CCATCATACT 

35 

2151 GGAAATCACA CGAAATTCAC 

2201 TACGGTTCTG GCCATCAATT 

40 2251 TAACCTTTTC ATGGCCTATG 

2301 GCTTATCCTT TAAACAGCAG 

2351 CAGTGATTAC AAGCTAATGT 

45 

2401 AAGATGGTGA AATAAAATGG 

2451 TATATCCATG ATCATTTTAT 

50 2501 CCCAATATTT ATGGAAGGAG 

2551 CTCAAGATGA TATTGAAAAA 



70 - 

ATCATCGCTA TGCTGAATTA TATGTGATTG 
TGTGAAACTG ATGGGTACTT AACTAAAATG 
TACAATCCAG TCACTTGCGG AAAGCACTTT 
GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
AAAGATTGCT ATTTGCAGAG TGATGGTTTT 
AATCTTCCTA TTATCTGGCT ACACAATGTG 
TAGGTTCACT TGACTCTCCA CCAACATGTG 
AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
ATTATTGAAA ATATCTTGGG AAAAGCCAGT 
AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
TATGAGGTTT ATGATGCAAA ATCAAAATCT 
CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
TGGGATATTG GAGTAATTGG AGCAATCCAG 
ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
TACTATGAAA AAGGAGAAAA ATGTCACTTT 
AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
TCCTGCAATG GAACATGGTC AGAAGATGTG 
TTTCCTGTGG ACAGAGCAAG CACATACTGT 
CAATTGGTGC TTCTGTTGCA AATTTTAATT 
AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
TTGTGTGATT GTTTCCTGGA TACTATCACC 
ATTTTATTAT TGAGTGGAAA AATCTTAATG 
CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
TGGGAAAACC AAAGATAATT AATAGTTTCA 
CACCAGAGTG ATGCAGGTTT ATATGTAATT 
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10 



2601 


GTGCCAGTAA 


2651 


ATCACACCAA 


2701 


AGAATTGTTC 


2751 


ATCTTTTTAT 


2801 


GAGCCTGAAA 


Zool 


TAAAGATGAG 


2901 


ATCTTGAAAA 


2951 


TTCTCTGAGG 


3001 


ACAACCCTTT 


3051 


AAACTGGTGA 



- 71 - 

TTATTTCCTC TTCCATCTTA 
AGAATGAAAA AGCTATTTTG 
CTGGGCACAA GGACTTAATT 
CAAGCATACA GCATCAGTGA 
CAATTTCAGA AGATATCAGT 
ATGATGCCAA CAACTGTGGT 
GGGTTCTGTT TGTTTTAGTG 
CTGAGGGTAC TGAGGTAACC 
GTTAAATACG CCACGCTGAT 
AGA 



TTGCTTGGAA CATTATTAAT 
GGAAGATGTT CCGAACCCCA 
TTCAGAAGAA ACGTTTGAGC 
CATGTGGTCC TCTTCTTTTG 
GTTGATACAT CATGGAAAAA 
CTCTCTACTT TCAACAACAG 
ACCAGTTCAA CAGTGTTAAC 
TATGAGGACG AAAGCCAGAG 
CAGCAACTCT AAACCAAGTG 
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Human OB Rpr-ppfnr "C " amino Acid Sequence (Seq. ID No. 5 (Amino 



1 


MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP 


NSTYDYFLLP 


51 


AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC 


FRSEQDRNCS 


101 

V A 


LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF 


ICYVESLFKN 


151 


LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV 


HECCECLVPV 


201 


PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP 


LGLHMEITDD 


251 


GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA 


TSLLVDSILP 


301 


GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI 


LTSVGSNVSF 


351 

» 


HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK 


VTFFNLNETK 


401 


PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG 


YLTKMTCRWS 


451 


TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL 


QSDGFYECIF 


501 


QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP 


SSVKAEITIK 


551 


t pt TifT qwpk PVFPENNLOF OIRYGLSGKE VOWKMYEVYD 


AKSKSVSLPV 


601 


PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM 


RGPEFWRIIN 


651 


GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT 


WSEDVGNHTK 


701 


FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI 


VQSLSAYPLN 


751 


SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS 


SVKKYYIHDH 


B01 


FIPIEKYQFS LYP IFMEGVG KPKIINSFTQ DDIEKHQSDA 


GLYVIVPVII 


651 


SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ 


KMLEGSMFVK 


901 


SHHHSLISST QGHKHCGRPQ GPLHRKTRDL CSLVYLLTLP 


PLLSYDPAKS 


951 


PSVRNTQE*S IKKKKKKLEG 
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Human OB Receptor *C« DNA Sequence (Sea. TP No. 6 fDNAl ) ; 



1 


CCGCCGCCAT 


CTCTGCCTTC 


51 


CTTCTCTGAA 


GTAAGATGAT 


101 


TTGGGAATTT 


ATTTATGTGA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


201 


TACTTCCTTT 


TGCCTGCTGG 


251 


ACATTATGAG 


ACAGCTGTTG 


301 


TTTCTAACTT 


ATCCAAAACA 


351 


GATAGAAACT 


GCTCCTTATG 


401 


TTCAACAGTA 


AATTCTTTAG 


451 


TACAGTGCTG 


GCTAAAAGGA 


501 , 


. TCATTATTTA 


AGAATCTATT 


551 


ATATGTTCTG 


CCTGAAGTGT 


601 


GCAGTTTTCA 


GATGGTTCAC 


651 


TGTCTTGTGC 


CTGTGCCAAC 


701 


TTTGAAAATC 


ACATCTGGTG 


751 


TTCAGCCCAT 


AAATATGGTG 


801 


GAAATCACAG 


ATGATGGTAA 


851 


GGTACCATTT 


CCACTTCAAT 


901 


CAGTTATCAG 


AGAAGCTGAC 


951 


GACAGTATAC 


TTCCTGGGTC 


1001 


ACTGGATGGC 


CCAGGAATCT 


1051 


CCACACAAGA 


TGTCATATAC 


1101 


TCTAATGTTT 


CTTTTCACTG 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


1201 


GCCAGTATGA 


TGTTGTGAGT 



GGTCGAGTTG GACCCCCGGA TCAAGGTGTA 
TTGTCAAAAA TTCTGTGTGG TTTTGTTACA 
TAACTGCGTT TAACTTGTCA TATCCAATTA 
TCTTGCATGC CACCAAATTC AACCTATGAC 
ACTCTCAAAG AATACTTCAA ATTCGAATGG 
AACCTAAGTT TAATTCAAGT GGTACTCACT 
ACTTTCCACT GTTGCTTTCG GAGTGAGCAA 
TGCAGACAAC ATTGAAGGAA AGACATTTGT 
TTTTTCAACA AATAGATGCA AACTGGAACA 
GACTTAAAAT TATTCATCTG TTATGTGGAG 
CAGGAATTAT AACTATAAGG TCCATCTTTT 
TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 
TGCAATTGCA GTGTTCATGA ATGTTGTGAA 
AGCCAAACTC AACGACACTC TCCTTATGTG 
GAGTAATTTT CCAGTCACCT CTAATGTCAG 
AAGCCTGATC CACCATTAGG TTTGCATATG 
TTTAAAGATT TCTTGGTCCA GCCCACCATT 
ATCAAGTGAA ATATTCAGAG AATTCTACAA 
AAGATTGTCT CAGCTACATC CCTGCTAGTA 
TTCGTATGAG GTTCAGGTGA GGGGCAAGAG 
GGAGTGACTG GAGTACTCCT CGTGTCTTTA 
TTTCCACCTA AAATTCTGAC AAGTGTTGGG 
CATCTATAAG AAGGAAAACA AGATTGTTCC 
GGATGAATTT AGCTGAGAAA ATTCCTCAAA 
GATCATGTTA GCAAAGTTAC TTTTTTCAAT 
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1251 CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG 
1301 CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 
1351 ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG 
1401 ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT 
1451 GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
1501 TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT 
1551 TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG 
1601 GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 
1651 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
1701 GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT 
1751 CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
1801 AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT 
1851 GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
1901 CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 
1951 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
2001 TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT 
2051 ACTTTGGAAG CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
2101 ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG 
2151 GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT 
2201 TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 
2251 TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
2301 GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC 
2351 CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 
2401 AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
2451 TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
2501 CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 
2551 CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT 
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2601 GTGCCAGTAA TTATTTCCTC TTCCATCTTA TTGCTTGGAA CATTATTAAT 

2651 ATCACACCAA AGAATGAAAA AGCTATTTTG GGAAGATGTT CCGAACCCCA 

2701 AGAATTGTTC CTGGGCACAA GGACTTAATT TTCAGAAGAT GCTTGAAGGC 

2751 AGCATGTTCG TTAAGAGTCA TCACCACTCC CTAATCTCAA GTACCCAGGG 

2801 ACACAAACAC TGCGGAAGGC CACAGGGTCC TCTGCATAGG AAAACCAGAG 

2851 ACCTTTGTTC ACTTGTTTAT CTGCTGACCC TCCCTCCACT ATTGTCCTAT 

2901 GACCCTGCCA AATCCCCCTC TGTGAGAAAC ACCCAAGAAT GATCAATAAA 

2951 AAAAAAAAAA AAAAAACTCG AGGGGG 
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Human OB RPPftPfor " H" Amino Rf^d SgQUfinCP fSftCTUftnCft ID NQ . 7) 

1 MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 

51 AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS 
101 LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF ICYVESLFKN 
151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 
201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 
251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 
301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 
351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 
401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 
451 TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 
501 QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 
551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 
601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRIIN 
651 GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 
701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 
751 SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS SVKKYYIHDH 
801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 
851 SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ KPETFEHLFI 
901 KHTASVTCGP LLLEPETISE DISVDTSWKN KDEMMPTTW SLLSTTDLEK 
951 GSVCISDQFN SVNFSEAEGT EVTYEDE5QR QPFVKYATLI SNSKPSETGE 
1001 EQGLINSSVT KCFSSKNSPL KDSFSNSSWE IEAQAFFILS DQHPNIISPH 
1051 LTFSEGLDEL LKLEGNFPEE NNDKKSIYYL GVTSIKKRES GVLLTDKSRV 
1101 SCPFPAPCLF TDIRVLQDSC SHFVENNINL GTSSKKTFAS YMPQFQTCST 
1151 QTHKIMENKM CDLTV*FH*R NLQICVIMGN IKCNRL*LWV GERKETRVKF 
1201 ENNCSK*KKK KKNSRPARPD 
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Human OB Receptor "D" Nucleic Acid Sequence f Sequence ID No.B) 



1 


GCGGCCGCCA 


GTGTGATGGA 


TATCTGCAGA ATTCGGCTTT 


CTCTGCCTTC 


51 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


CTTCTCTGAA 


GTAAGATGAT 


101 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


TTGGGAATTT 


ATTTATGTGA 


151 


TAACTGCGTT 


" TAACTTGTCA 


TATCCAATTA 


CTCCTTGGAG 


ATTTAAGTTG 


201 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


TACTTCCTTT 


TGCCTGCTGG 


251 


GCTCTCAAAG 


AATACTTCAA 


ATTCGAATGG ACATTATGAG 


ACAGCTGTTG 


301 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


TTTCTAACTT 


ATCCAAAACA 


351 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


GATAGAAACT 


GCTCCTTATG 


401 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


TTCAACAGTA 


AATTCTTTAG 


451 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


TACAGTGCTG 


GCTAAAAGGA 


501 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


TCATTATTTA 


AGAATCTATT 


551 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


ATATGTTCTG 


CCTGAAGTGT 


€01 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


GCAGTTTTCA 


GATGGTTCAC 


651 


TGCAATTGCA 


GTGTTCACGA 


ATGTTGTGAA 


TGTCTTGTGC 


CTGTGCCAAC 


701 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


TTTGAAAATC 


ACATCTGGTG 


751 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


TTCAGCCCAT 


AAATATGGTG 


801 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


GAAATCACAG 


ATGATGGTAA 


851 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


GGTACCATTT 


CCACTTCAAT 


901 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA CAGTTATCAG 


AGAAGCTGAC 


951 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA GACAGTATAC 


TTCCTGGGTC 


1001 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


ACTGGATGGC 


CCAGGAATCT 


1051 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


CCACACAAGA 


TGTCATATAC 


1101 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


TCTAATGTTT 


CTTTTCACTG 


1151 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


CTCAAAAGAG 


ATTGTTTGGT 


1201 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


GCCAGTATGA 


TGTTGTGAGT 


1251 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


CTGAATGAAA 


CCAAACCTCG 
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1301 AGGAAAGTTT ACCTATGATG 

1351 ATCATCGCTA TGCTGAATTA 

5 

1401 TGTGAAACTG ATGGGTACTT 

1451 TACAATCCAG TCACTTGCGG 

10 1501 GCAGCCTTTA CTGTTCTGAT 

1551 AAAGATTGCT ATTTGCAGAG 

1601 AATCTTCCTA TTATCTGGCT 

15 

1651 TAGGTTCACT TGACTCTCCA 

1701 AAGCCACTGC CTCCATCCAG 

20 1751 ATTATTGAAA ATATCTTGGG 

1801 AATTCCAGAT TCGCTATGGT 

1851 TATGAGGTTT ATGATGCAAA 

25 

1901 CTTGTGTGCA GTCTATGCTG 

1951 TGGGATATTG GAGTAATTGG 

30 2001 ATAAAAGTTC CTATGAGAGG 

2051 TACTATGAAA AAGGAGAAAA 

2101 AAAATGACTC ATTGTGCAGT 

35 

2151 TCCTGCAATG GAACATGGTC 

2201 TTTCCTGTGG ACAGAGCAAG 

40 2251 CAATTGGTGC TTCTGTTGCA 

2301 AGCAAAGTAA ATATCGTGCA 

2351 TTGTGTGATT GTTTCCTGGA 

45 

2401 ATTTTATTAT TGAGTGGAAA 

2451 CTTAGAATCT CTTCATCTGT 

50 2501 CCCCATTGAG AAGTACCAGT 

2551 TGGGAAAACC AAAGATAATT 



CAGTGTACTG CTGCAATGAA CATGAATGCC 
TATGTGATTG ATGTCAATAT CAATATCTCA 
AACTAAAATG ACTTGCAGAT GGTCAACCAG 
AAAGCACTTT GCAATTGAGG TATCATAGGA 
ATTCCATCTA TTCATCCCAT ATCTGAGCCC 
TGATGGTTTT TATGAATGCA TTTTCCAGCC 
ACACAATGTG GATTAGGATC AATCACTCTC 
CCAACATGTG TCCTTCCTGA TTCTGTGGTG 
TGTGAAAGCA GAAATTACTA TAAACATTGG 
AAAAGCCAGT CTTTCCAGAG AATAACCTTC 
TTAAGTGGAA AAGAAGTACA ATGGAAGATG 
ATCAAAATCT GTCAGTCTCC CAGTTCCAGA 
TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
AGCAATCCAG CCTACACAGT TGTCATGGAT 
ACCTGAATTT TGGAGAATAA TTAATGGAGA 
ATGTCACTTT ACTTTGGAAG CCCCTGATGA 
GTTCAGAGAT ATGTGATAAA CCATCATACT 
AGAAGATGTG GGAAATCACA CGAAATTCAC 
CACATACTGT TACGGTTCTG GCCATCAATT 
AATTTTAATT TAACCTTTTC ATGGCCTATG 
GTCACTCAGT GCTTATCCTT TAAACAGCAG 
TACTATCACC CAGTGATTAC AAGCTAATGT 
AATCTTAATG AAGATGGTGA AATAAAATGG 
TAAGAAGTAT TATATCCATG ATCATTTTAT 
TCAGTCTTTA CCCAATATTT ATGGAAGGAG 
AATAGTTTCA CTCAAGATGA TATTGAAAAA 
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2601 
2651 
5 2701 
2751 
2801 

10 

2851 
2901 
15 2951 
3001 
3051 

20 

3101 
3151 
25 3201 
3251 
3301 

30 

3351 
3401 
35 3451 
3501 
3551 



CACCAGAGTG 
TTCCATCTTA 
AGCTATTTTG 
GGACTTAATT 
TACAGCATCA 
CAGAAGATAT 
CCAACAACTG 
TGTTTGTATT 
GTACTGAGGT 
TACGCCACGC 
AGGGCTTATA 
CGTTGAAGGA 
TTTTTTATAT 
ATTCTCAGAA 
AAGAAAATAA 
AAAAAGAGAG 
CCCATTCCCA 
GTTGCTCACA 
AAGACTTTTG 
TCATAAGATC 



ATGCAGGTTT 
TTGCTTGGAA 
GGAAGATGTT 
TTCAGAAGCC 
GTGACATGTG 
CAGTGTTGAT 
TGGTCTCTCT 
AGTGACCAGT 
AACCTATGAG 
TGATCAGCAA 
AATAGTTCAG 
TTCTTTCTCT 
TATCGGATCA 
GGATTGGATG 
TGATAAAAAG 
AGAGTGGTGT 
GCCCCCTGTT 
CTTTGTAGAA 
CATCTTACAT 
ATGGAAAACA 
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ATATGTAATT 
CATTATTAAT 
CCGAACCCCA 
AGAAACGTTT 
GTCCTCTTCT 
ACATCATGGA 
ACTTTCAACA 
TCAACAGTGT 
GACGAAAGCC 
CTCTAAACCA 
TCACCAAGTG 
AATAGCTCAT 
GCATCCCAAC 
AACTTTTGAA 
TCTATCTATT 
GCTTTTGACT 
TATTCACGGA 
AATAATATCA 
GCCTCAATTC 
AGATGTGTGA 



GTGCCAGTAA 
ATCACACCAA 
AGAATTGTTC 
GAGCATCTTT 
TTTGGAGCCT 
AAAATAAAGA 
ACAGATCTTG 
TAACTTCTCT 
AGAGACAACC 
AGTGAAACTG 
CTTCTCTAGC 
GGGAGATAGA 
ATAATTTCAC 
ATTGGAGGGA 
ATTTAGGGGT 
GACAAGTCAA 
CATCAGAGTT 
ACTTAGGAAC 
CAAACTTGTT 
CCTAACTGTG 



TTATTTCCTC 
AGAATGAAAA 
CTGGGCACAA 
TTATCAAGCA 
GAAACAATTT 
TGAGATGATG 
AAAAGGGTTC 
GAGGCTGAGG 
CTTTGTTAAA 
GTGAAGAACA 
AAAAATTCTC 
GGCCCAGGCA 
CACACCTCAC 
AATTTCCCTG 
CACCTCAATC 
GGGTATCGTG 
CTCCAGGACA 
TTCTAGTAAG 
CTACTCAGAC 
TAATCTAGA 
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Hnman OB Rprpntftr PrntPin "D " Chromosomal DNA (SPCf. TP NO. 9) 



Intron 1 



.taecttttccag 



GTS TAC TIC 



CAT TOG G gtaagttatttg. 
His Tip Glu 
12 13 14 



Intron 2 



.atatecteacag AA TTT ATT 

Phe lie 

15 16 



CM ATA G gtaagcattagc, 
Gin He Asp 
122 123 124 



Intron 3 



.ttttaaattcag 



AT GCA AAC 
Ala Aan 
125 126 



TAT GTT CT gtaagtaceaaa. 
Tyr Val Leu 
163 164 165 



Intron 4 



■ ttwcaatatag 



G OCT GAA 
Pro GlU 
166 167 



AAT ATG G gtaagttatgca , 
Aan Mat Val 
233 234 235 



Intron 5 



.tttttcettaag 



TG AAG CCT 
Lys Pro 
236 237 



ATC AGA GAA gtaagtatattt , 
He Arg Glu 
281 262 283 



Intron 6 



.aatatttaacag 



GCT GAC AAG 
Ala Asp Lys 
284 285 286 



ACA CAA G gtaggttatgta. 
Thr Gin Asp 
330 331 332 



Intron 7 



.ccctcattacag 



AT GTC ATA 
Val lift 
333 334 



GTG ATT G gtaagaaaacag . 
Val He Asp 
427 428 429 



Intron 8 



.tgtttcaaatag 



AT GTC AAT 
Val Aan 
430 431 



TAT CAT AG gtacgtattatt, 
Tyr His Arg 
466 467 468 



Intron 9 



.tatcttttaaag 



G AGC AGC 
Ser Ser 
469 470 



TCT GTG G gtatgtcaagct , 
Ser Val Val 
533 534 535 



Intron 10 



.aaaaatttctag 



TG AAG CCA 
Lys Pro 

536 537 



CAA TGG AAG gtacettttaet . 
Gin Tip Lya 
582 583 584 



Intron 11 



.cttattttacag 



ATG TAT GAG 
Met Tyr Glu 
585 586 587 



ATA AAA G gtctgcagagat . 
He Lys Val 
636 637 638 



Intron 12 



.gtcattttgcag 



TT CCT ATG 
Pro Met 
639 640 
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CTT TGG AAG gtattcccaatt , 
Leu Trp Lys 
663 664 665 



Intron 13 



.tatttactaeag 



COC CTG ATS 
Pro Leu Met 
666 667 668 



AGC AAA G gtaagaagaggt. 
Ser Lys Val 
10 736 737 738 



Intron 14 



.ttttcccctcag 



TA AAT ATC 
Asn He 
739 740 



ATC CAT G gtAagtttacta. 
Ha His Asp 
15 797 798 799 



Intron 15 



.ttttctccteag 



AT CAT TTT 
His Pho 
600 801 



ACT CAA G gta&aaattata. 
Thx Gin Asp 
20 829 830 831 



Intron 16 



.tttctttttcag 



AT GAT ATT 
Asp II* 

832 833 



25 



30 



35 



40 



45 



CAC 


CAA 




Intron 17 




A 


ATG 


AAA 


His 


Gin 


Arg 








Mat 


Lys 


864 


865 


866 








867 


868 














Exon A 


TTT 


CAG 




Intron 18 




AGA 


AOG 


GAC 


Phe 


Gin 


Lys 






Arg 


Thr 


Asp 


889 


890 


891 






892 


893 


894 




Exon A 






Exon 


D 




AAA 


TAT 




Intron IB 




CCA 


GAA 


AOG 












Pro 


Glu 


Thr 












892 


893 


894 












Exon 


B 














AAA 


CGT 


TTG 












Lys 


Arg 


Leu 












692 


893 


894 




Exon D 






Exon C 




GAA 


ACC 




Intron 18 




ATG 


CTT 


GAA 












Met 


Leu 


Glu 












892 


893 


894 
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flrnnan OB R^rpptor Prntftin, KPromhinant Secreted ReCftPt.Or amino 
^fliiPnpp fSPQ. TP. NO . 1Q1 : 

1 MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 

51 AGX*SKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS 

101 LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF ICYVESLFKN 

151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 

201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 

251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 

301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 

351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 

401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 

451 TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 

501 QPIFLLSGYT MWIRINH5LG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 

551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 

601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRI IN 

651 GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 

701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 

751 SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS SVKKYYIHDH 

801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSD 
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Human OB Receptor P rotPln. Recombinant Secreted Receptor DNA 
gpgiienrp ffipo. TP. No. 11): 



1 


GCGGCCGCCA 

W W w wW w\a£b 


GTGTGATGGA 


TATCTGCAGA 


ATTCGGCTTT 


CTCTGCCTTC 




rime g agttg 


GACCCCCGGA 


TCAAGGTGTA 


CTTCTCTGAA 

A A A Aj A 


GTAAGATGAT 

A a A b^^A ■ ^VA A A 




TTGTCAAAAA 


TTCTGTGTGG 

a * a a. «j a a^ 


TTTTGTTACA 

Aj AAA A A m A^*#> • 


TTGGGAATTT 

A A ^# P» A A A A 


ATTTATGTGA 


151 

At V 4* 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


CTCCTTGGAG 


ATTTAAGTTG 


201 

A* W J* 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


TACTTCCTTT 


TGCCTGCTGG 


251 


GCTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


ACATTATGAG 


ACAGCTGTTG 


301 

v V A 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


TTTCTAACTT 


ATCCAAAACA 


351 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


GATAGAAACT 


GCTCCTTATG 


401 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


TTCAACAGTA 


AATTCTTTAG 


451 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


TACAGTGCTG 


GCTAAAAGGA 


501 


GACTTAAAAT 

^b|PA B^tfP A A * ^A 4* • A 


TATTCATCTG 


TTATGTGGAG 


TCATTATTTA 


AGAATCTATT 


551 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


ATATGTTCTG 


CCTGAAGTGT 


601 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


GCAGTTTTCA 


GATGGTTCAC 


651 


TGCAATTGCA 

* ^|PA * A A ^A * 


GTGTTCACGA 


ATGTTGTGAA 


TGTCTTGTGC 


CTGTGCCAAC 


701 

r V a* 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 

A ^■r^i^ A A • » A A 


TTTGAAAATC 

A A Aj W* ** ** ** A A 


ACATCTGGTG 


751 

r a/ a> 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


TTCAGCCCAT 


AAATATGGTG 


801 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


GAAATCACAG 


ATGATGGTAA 


851 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


GGTACCATTT 


CCACTTCAAT 


901 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


CAGTTATCAG 


AGAAGCTGAC 


951 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


GACAGTATAC 


TTCCTGGGTC 


1001 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


ACTGGATGGC 


CCAGGAATCT 


1051 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


CCACACAAGA 


TGTCATATAC 


1101 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


TCTAATGTTT 


CTTTTCACTG 


1151 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


CTCAAAAGAG 


ATTGTTTGGT 


1201 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


GCCAGTATGA 


TGTTGTGAGT 
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1251 GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG 
1301 AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 
1351 ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA 
1401 TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG 
1451 TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 
1501 GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC 
1551 AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC 
1601 AATCTTCCTA TTATCTGGCT ACACAATGTG GATTAGGATC AATCACTCTC 
1651 TAGGTTCACT TGACTCTCCA CCAACATGTG TCCTTCCTGA TTCTGTGGTG 
1701 AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA TAAACATTGG 
1751 ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACC TTC 
1801 AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG 
1851 TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA 
1901 CTTGTGTGCA GTCTATGCTG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
1951 TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT 
2001 ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA 
2051 TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 
2101 AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT 
2151 TCCTGCAATG GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC 
2201 TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 
2251 CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG 
2301 AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG 
2351 TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 
2401 ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG 
2451 CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT 
2501 CCCCATTGAG AAGTACCAGT TCAGTCTTTA CCCAATATTT ATGGAAGGAG 
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2551 TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA 
2601 CACCAGAGTG ATTGATAAGG ATCC 
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fflim^n fm p ^^pfnr PrntP^n. Rpcnmm nant Sprrf>tf>ri RftfifiPt.Qr DMA 

^TionrP v \th r-t-prmHnal FT.&ft fKPO. ID. No . 12) : 

1 CCATTGAAGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGGATTT 

51 CCAAAATGTC GTAATAACCC CGCCCCGTTG ACGCAAATGG GCGGTAGGCG 
101 TGTACGGTGG GAGGTCTATA TAAGCAGAGC TCGTTTAGTG AACCGTCAGA 
151 TCTCTAGAAG CTGGGTACCA GCTGCTAGCA AGCTTGCTAG CGGCCGCCAG 
201 TGTGATGGAT ATCTGCAGAA TTCGGCTTTC TCTGCCTTCG GTCGAGTTGG 
251 ACCCCCGGAT CAAGGTGTAC TTCTCTGAAG TAAGATGATT TGTCAAAAAT 
301 TCTGTGTGGT TTTGTTACAT TGGGAATTTA TTTATGTGAT AACTGCGTTT 
351 AACTTGTCAT ATCCAATTAC TCCTTGGAGA TTTAAGTTGT CTTGCATGCC 
401 ACCAAATTCA ACCTATGACT ACTTCCTTTT GCCTGCTGGG CTCTCAAAGA 
451 ATACTTCAAA TTCGAATGGA CATTATGAGA CAGCTGTTGA ACCTAAGTTT 
501 AATTCAAGTG GTACTCACTT TTCTAACTTA TCCAAAACAA CTTTCCACTG 
551 TTGCTTTCGG AGTGAGCAAG ATAGAAACTG CTCCTTATGT GCAGACAACA 
601 TTGAAGGAAA GACATTTGTT TCAACAGTAA ATTCTTTAGT TTTTCAACAA 
651 ATAGATGCAA ACTGGAACAT ACAGTGCTGG CTAAAAGGAG ACTTAAAATT 
701 ATTCATCTGT TATGTGGAGT CATTATTTAA GAATCTATTC AGGAATTATA 
751 ACTATAAGGT CCATCTTTTA TATGTTCTGC CTGAAGTGTT AGAAGATTCA 
801 CCTCTGGTTC CCCAAAAAGG CAGTTTTCAG ATGGTTCACT GCAATTGCAG 
851 TGTTCACGAA TGTTGTGAAT GTCTTGTGCC TGTGCCAACA GCCAAACTCA 
901. ACGACACTCT CCTTATGTGT TTGAAAATCA CATCTGGTGG AGTAATTTTC 
951 CAGTCACCTC TAATGTCAGT TCAGCCCATA AATATGGTGA AGCCTGATCC 
1001 ACCATTAGGT TTGCATATGG AAATCACAGA TGATGGTAAT TTAAAGATTT 
1051 CTTGGTCCAG CCCACCATTG GTACCATTTC CACTTCAATA TCAAGTGAAA 
1101 TATTCAGAGA ATTCTACAAC AGTTATCAGA GAAGCTGACA AGATTGTCTC 
1151 AGCTACATCC CTGCTAGTAG ACAGTATACT TCCTGGGTCT TCGTATGAGG 
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15 



20 



25 



30 



35 



40 



45 



50 







— 
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1201 


TTCAGGTGAG 


GGGCAAGAGA 


CTGGATGGCC 


CAGGAATCTG 


GAGTGACTGG 


1251 


AGTACTCCTC 


GTGTCTTTAC 


CACACAAGAT 


GTCATATACT 


TTCCACCTAA 


1301 


AATTCTGACA 


AGTGTTGGGT 


CTAATGTTTC 


TTTTCACTGC 


ATCTATAAGA 


1351 


AGGAAAACAA 


GATTGTTCCC 


TCAAAAGAGA 


TTGTTTGGTG 


GATGAATTTA 


1401 


GCTGAGAAAA 


TTCCTCAAAG 


CCAGTATGAT 


GTTGTGAGTG 


ATCATGTTAG 


1451 


CAAAGTTACT 


TTTTTCAATC 


TGAATGAAAC 


CAAACCTCGA 


GGAAAGTTTA 


1501 


CCTATGATGC 


AGTGTACTGC 


TGGAATGAAC 


ATGAATGCCA 


TCATCGCTAT 


1551 


GCTGAATTAT 


ATGTGATTGA 


TGTCAATATC 


AATATCTCAT 


GTGAAACTGA 


1601 


TGGGTACTTA 


ACTAAAATGA 


CTTGCAGATG 


GTCAACCAGT 


ACAATCCAGT 


1651 


CACTTGCGGA 


AAGCACTTTG 


CAATTGAGGT 


ATCATAGGAG 


CAGCCTTTAC 


1701 


TGTTCTGATA 


TTCCATCTAT 


TCATCCCATA 


TCTGAGCCCA 


AAGATTGCTA 


1751 


TTTGCAGAGT 


GATGGTTTTT 


ATGAATGCAT 


TTTCCAGCCA 


ATCTTCCTAT 


1B01 


TATCTGGCTA 


CACAATGTGG 


ATTAGGATCA 


ATCACTCTCT 


AGGTTCACTT 


1651 


GACTCTCCAC 


CAACATGTGT 


CCTTCCTGAT 


TCTGTGGTGA 


AGCCACTGCC 


1901 


TCCATCCAGT 


GTGAAAGCAG 


AAATTACTAT 


AAACATTGGA 


TTATTGAAAA 


1951 


TATCTTGGGA 


AAAGCCAGTC 


TTTCCAGAGA 


ATAACCTTCA 


ATTCCAGATT 


2001 


CGCTATGGTT 


TAAGTGGAAA 


AGAAGTACAA 


TGGAAGATGT 


ATGAGGTTTA 


2051 


TGATGCAAAA 


TCAAAATCTG 


TCAGTCTCCC 


AGTTCCAGAC 


TTGTGTGCAG 


2101 


TCTATGCTGT 


TCAGGTGCGC 


TGTAAGAGGC 


TAGATGGACT 


GGGATATTGG 


2151 


AGTAATTGGA 


GCAATCCAGC 


CTACACAGTT 


GTCATGGATA 


TAAAAGTTCC 


2201 


TATGAGAGGA 


CCTGAATTTT 


GGAGAATAAT 


TAATGGAGAT 


ACTATGAAAA 


2251 


AGGAGAAAAA 


TGTCACTTTA 


CTTTGGAAGC 


CCCTGATGAA 


AAATGACTCA 


2301 


TTGTGCAGTG 


TTCAGAGATA 


TGTGATAAAC 


CATCATACTT 


CCTGCAATGG 


2351 


AACATGGTCA 


GAAGATGTGG 


GAAATCACAC 


GAAATTCACT 


TTCCTGTGGA 


2401 


CAGAGCAAGC 


ACATACTGTT 


ACGGTTCTGG 


CCATCAATTC 


AATTGGTGCT 


2451 


TCTGTTGCAA 


ATTTTAATTT 


AACCTTTTCA 


TGGCCTATGA 


GCAAAGTAAA 


2501 


TATCGTGCAG 


TCACTCAGTG 


CTTATCCTTT 


AAACAGCAGT 


TGTGTGATTG 
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2551 TTTCCTGGAT ACTATCACCC AGTGATTACA AGCTAATGTA TTTTATTATT 

2601 GAGTGGAAAA ATCTTAATGA AGATGGTGAA ATAAAATGGC TTAGAATCTC 

2651 TTCATCTGTT AAGAAGTATT ATATCCATGA TCATTTTATC CCCATTGAGA 

2701 AGTACCAGTT CAGTCTTTAC CCAATATTTA TGGAAGGAGT GGGAAAACCA 

2751 AAGATAATTA ATAGTTTCAC TCAAGATGAT ATTGAAAAAC ACCAGAGTGA 

2801 TGCAGGTGAC TACAAGGACG ACGATGACAA GTAGGGATCC AGACATGATA 

2851 AGATACATTG ATGAGTTTGG ACAACCCACA ACTAGAATGC AGTGAAAAAA 

2901 ATGCTTTATT TGTGAAATTT GTGATGCTAT TGCTTTATTT GTAACCAT 
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Rgrnmbinant Human OB Receptor Protein. Natural Splire Variant 
amlno acid sequent (Sea. ID. No. 13) 



5 


1 


MICQKFCWL 


LHWEFIYVIT 


AFNLSYPITP 


WRFKLSCMPP 


NSTYDYFLLP 




51 . 


AGLSKNTSNS 


NGHYETAVEP 


KFNSSGTHFS 


NLSKTTFHCC 


FRSEQDRNCS 


10 


101 


LCADNIEGKT 


FVSTVNSLVF 


QQIDANWNIQ 


CWLKGDLKLF 


ICYVESLFKN 


151 


LFRNYNYKVH 


LLYVLPEVLE 


DSPLVPQKGS 


FQMVHCNCSV 


HECCECLVPV 




201 


PTAKLNDTLL 


MCLKITSGGV 


IFQSPLMSVQ PINMVKPDPP 


LGLHMEITDD 


15 


251 


GNLKISWSSP 


PLVPFPLQYQ 


VKYSENSTTV 


IREADKIVSA 


TSLLVDSILP 




301 


GSSYEVQVRG 


KRLDGPGIWS 


DWSTPRVFTT 


QDVIYFPPKI 


LTSVGSNVSF 


20 


351 


HCIYKKENKI 


VPSKEIVWWM NLAEKIPQSQ 


YDWSDHVSK 


VTFFNLNETK 


401 


PRGKFTYDAV 


YCCNEHECHH 


RYAELYVIDV 


NINISCETDG 


YLTKMTCRWS 




451 


TSTIQSLAES 


TLQLRYHRSS 


LYCSDIPSIH 


PISEPKDCYL 


QSDGFYECIF 


25 


501 


QPIFLLSGYT 


MWIRINHSLG 


SLDSPPTCVL 


PDSWKPLPP 






551 


IGLLKISWEK 


PVFPENNLQF 


QIRYGLSGKE 


VQWKMYEVYD 


AKSKSVSLPV 


30 


601 


PDLCAVYAVQ VRCKRLDGLG 


YWSNWSNPAY 


TWMDIKVPM 


RGPEFWRI IN 


651 


GDTMKKEKNV 


TLLWKPLMKN 


DSLCSVQRYV 


INHHTSCNGT 


WSEDVGNHTK 




701 


FTFLWTEQAH 


TVTVLAINSI 


GASVANFNLT 


FSWPMSKVNI 


VQSLSAYPLN 


35 


751 
801 


SSCVIVSWIL 
FTIL 


SPSDYKLMYF 


IIEWKNLNED 


GEIKWLRISS 


SVKKYYIHGK 
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Hnffl ft" fi R Rp^nr p r n*-oin. Natural Snlirp Variant DNA (Sftq. ID. 
NO. 14) 

1 GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC 

51 GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT 
101 TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 
151 TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG 
201 TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG 
251 GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 
301 AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA 
351 ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG 
401 TGCAGACAAC ATTGAAGGAA AGACATTTGT TTCAACAGTA AATTCTTTAG 
451 TTTTTCAACA AATAGATGCA AACTGGAACA TACAGTGCTG GCTAAAAGGA 
501 GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT 
551 CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 
601 TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC 
651 TGCAATTGCA GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC 
701 AGCCAAACTC AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 
751 GAGTAATTTT CCAGTCACCT CTAATGTCAG TTCAGCCCAT AAATATGGTG 
801 AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA 
851 TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 
901 ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC 
951 AAGATTGTCT CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC 
1001 TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 
1051 GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC 
1101 TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG 
1151 CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 
1201 GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT 
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1251 GATCATGTTA 

1301 AGGAAAGTTT 

5 1351 ATCATCGCTA 

1401 TGTGAAACTG 

1451 TACAATCCAG 

10 

1501 GCAGCCTTTA 

1551 AAAGATTGCT 

15 1601 AATCTTCCTA 

1651 TAGGTTCACT 

1701 AAGCCACTGC 

20 

1751 ATTATTGAAA 

1801 AATTCCAGAT 

25 1851 TATGAGGTTT 

1901 CTTGTGTGCA 

1951 TGGGATATTG 

30 

2001 ATAAAAGTTC 

2051 TACTATGAAA 

35 2101 AAAATGACTC 

2151 TCCTGCAATG 

2201 TTTCCTGTGG 

40 

2251 CAATTGGTGC 

2301 AGCAAAGTAA 

45 2351 TTGTGTGATT 

2401 ATTTTATTAT 

2451 CTTAGAATCT 

50 

2501 TATACTT 



- 91 - 

GCAAAGTTAC TTTTTTCAAT 
ACCTATGATG CAGTGTACTG 
TGCTGAATTA TATGTGATTG 
ATGGGTACTT AACTAAAATG 
TCACTTGCGG AAAGCACTTT 
CTGTTCTGAT ATTCCATCTA 
ATTTGCAGAG TGATGGTTTT 
TTATCTGGCT ACACAATGTG 
TGACTCTCCA CCAACATGTG 
CTCCATCCAG TGTGAAAGCA 
ATATCTTGGG AAAAGCCAGT 
TCGCTATGGT TTAAGTGGAA 
ATGATGCAAA ATCAAAATCT 
GTCTATGCTG TTCAGGTGCG 
GAGTAATTGG AGCAATCCAG 
CTATGAGAGG ACCTGAATTT 
AAGGAGAAAA ATGTCACTTT 
ATTGTGCAGT GTTCAGAGAT 
GAACATGGTC AGAAGATGTG 
ACAGAGCAAG CACATACTGT 
TTCTGTTGCA AATTTTAATT 
ATATCGTGCA GTCACTCAGT 
GTTTCCTGGA TACTATCACC 
TGAGTGGAAA AATCTTAATG 
CTTCATCTGT TAAGAAGTAT 



CTGAATGAAA CCAAACCTCG 
CTGCAATGAA CATGAATGCC 
ATGTCAATAT CAATATCTCA 
ACTTGCAGAT GGTCAACCAG 
GCAATTGAGG TATCATAGGA 
TTCATCCCAT ATCTGAGCCC 
TATGAATGCA TTTTCCAGCC 
GATTAGGATC AATCACTCTC 
TCCTTCCTGA TTCTGTGGTG 
GAAATTACTA TAAACATTGG 
CTTTCCAGAG AATAACCTTC 
AAGAAGTACA ATGGAAGATG 
GTCAGTCTCC CAGTTCCAGA 
CTGTAAGAGG CTAGATGGAC 
CCTACACAGT TGTCATGGAT 
TGGAGAATAA TTAATGGAGA 
ACTTTGGAAG CCCCTGATGA 
ATGTGATAAA CCATCATACT 
GGAAATCACA CGAAATTCAC 
TACGGTTCTG GCCATCAATT 
TAACCTTTTC ATGGCCTATG 
GCTTATCCTT TAAACAGCAG 
CAGTGATTAC AAGCTAATGT 
AAGATGGTGA AATAAAATGG 
TATATCCATG GTAAGTTTAC 
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While the present invention has been described 
in terms of preferred embodiments, it is understood that 
variations and modifications will occur to those skilled 
in the art. Therefore, it is intended that the appended 
5 claims cover all such equivalent variations which come 
within the scope of the invention as claimed. 
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15 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: CHANG, MING-SHI 

WELCHER, ANDREW A. 
FLETCHER, FREDERICK A. 

(ii) TITLE OF INVENTION: OB PROTEIN RECEPTOR AND RELATED 
COMPOSITIONS AND METHODS 

(1U) NUMBER OF SEQUENCES: 33 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arogen Inc. 

(B) STREET : 1640 Dehavilland Drive 

(C) CITY: Thousand Oaks 
20 (D) STATE: California 

<E) COUNTRY: USA 

(F) ZIP: 91320 

(V) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE : Patent In Release #1.0, Version #1.30 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Pessin, Karol M. 
(C) REFERENCE /DOCKET NUMBER: A-382-A 



40 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 amino acids 

(B) TYPE: amino acid 

45 tC) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
1 5 10 15 

Tyr Val He Thr Ala Phe Aan Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lya Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Aap Tyr Phe Leu 
35 40 «5 

Leu Pro Ala Gly Leu Ser Lya Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 *0 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 ' 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 1«5 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 \ 270 

\ 

\ 

\ 
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Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 
275 260 2BS 

Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly lie Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 

325 330 335 



15 



Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 



lie Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys Glu lie Val Trp 
355 360 365 



20 



Trp Met Asn Leu Ala Glu Lys lie Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 



25 



Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 



30 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn lie 
420 425 430 



Asn lie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 



35 



Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 



40 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 

465 470 475 460 

Pro Zle Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 



45 



Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 



He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 



50 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 
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Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Ty.r Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
€60 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 

805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 
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lie Glu Lys His Gin Sex Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
850 855 B60 

Gin Axg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg Thr Asp He Leu 

885 890 895 



Ser Leu He Met lie Thr Thr Asp Glu Pro Asn Val Pro Thr Ser Gin 
15 900 905 910 

Gin Ser He Glu Tyr Lys He Phe Thr Phe Arg Arg Gly Ala Asn Leu 
915 920 925 

20 Lys Lys He Gin Leu Asn Phe Glu Leu Thr Tyr Gly Gly Leu Cys Phe 

930 935 940 



Arg Thr Asn Arg Cys Val Asn Leu Gly Ser Lys Cys Arg Phe Glu Ser 
945 950 955 960 

Ser Leu Asp Val Leu 

965 

(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3193 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA 60 

GTAAGATGAT TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 120 

TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG TCTTGCATGC 180 

50 CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA 240 

ATTCGAATGG ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGTACTCACT 300 
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TTTCTAACTT ATCCAAAACA ACTTTCCACT 
GCTCCTTATG TGCAGACAAC ATTGAAGGAA 
5 TTTTTCAACA AATAGATGCA AACTGGAACA 
TATTCATCTG TTATGTGGAG TCATTATTTA 
TCCATCTTTT ATATGTTCTG CCTGAAGTGT 

10 

GCAGTTTTCA GATGGTTCAC TGCAATTGCA 
CTGTGCCAAC AGCCAAACTC AACGACACTC 
15 GAGIAATTTT CCAGTCACCT CTAATGTCAG 
CACCATTAGG TTTGCATATG GAAATCACAG 
GCCCACCATT GGTACCATTT CCACTTCAAT 

20 

CAGTTATCAG AGAAGCTGAC AAGATTGTCT 
TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 
25 GGAGTGACTG GAGTACTCCT CGTGTCTTTA 
AAATTCTGAC AAGTGTTGGG TCTAATGTTT 
AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 

30 

GCCAGTATGA TGTTGTGAGT GATCATGTTA 
CCAAACCTCG AGGAAAGTTT ACCTATGATG 
35 ATCATCGCTA TGCTGAATTA TATGTGATTG 
ATGGGTACTT AACTAAAATG ACTTGCAGAT 
AAAGCACTTT GCAATTGAGG TAT CAT AG GA 

40 

TTCATCCCAT ATCTGAGCCC AAAGATTGCT 
TTTTCCAGCC AATCTTCCTA TTATCTGGCT 
45 TAGGTTCACT TGACTCTCCA CCAACATGTG 
CTCCATCCAG TGTGAAAGCA GAAATTACTA 
AAAAGCCAGT CTTTCCAGAG AATAACCTTC 

50 

AAGAAGTACA ATGGAAGATG TATGAGGTTT 
CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 
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GTTGCTTTCG GAGTGAGCAA GATAGAAACT 360 

AGACATTTGT TTCAACAGTA AATTCTTTAG 420 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT 480 

AGAATCTATT CAGGAATTAT AACTATAAGG 540 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 600 

GTGTTCATGA ATGTTGTGAA TGTCTTGTGC 660 

TCCTTATGTG TTTGAAAATC ACATCTGGTG 720 

TTCAGCCCAT AAATATGGTG AAGCCTGATC 780 

ATGATGGTAA 7TTAAAGATT TCTTGGTCCA 840 

ATCAAGTGAA AT AT TC AG AG AATTCTACAA 900 

CAGCTACATC CCTGCTAGTA GACAGTATAC 960 

GGGGCAAGAG ACTGGATGGC CCAGGAATCT 1020 

CCACACAAGA TGTCATATAC TTTCCACCTA 1080 

CTTTTCACTG CATCTATAAG AAGGAAAACA 1140 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA 1200 

GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 1260 

CAGTGTACTG CTGCAATGAA CATGAATGCC 1320 

ATGTCAATAT CAATATCTCA TGTGAAACTG 1380 

GGTCAACCAG TACAATCCAG TCACTTGCGG 1440 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 1500 

ATTTGCAGAG TGATGGTTTT TATGAATGCA 1560 

ACACAATGTG GATTAGGATC AATCACTCTC 1620 

TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 1680 

TAAACATTGG ATTATTGAAA ATATCTTGGG 1740 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA 1B00 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC 1860 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 1920 
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TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT 
CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA 

5 

ATGTCACTTT ACTTTGGAAG CCCCTGATGA AAAATGACTC 
ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC 
10 CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT 
CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC 
ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG 

15 

TACTATCACC CAGTGATTAC AAGCTAATGT ATTTTATTAT 
AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT 
20 ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA 
ATGCAGGTTT ATATGTAATT GTGCCAGTAA TTATTTCCTC 

25 

CATTATTAAT ATCACACCAA AGAATGAAAA AGCTATTTTG 
AGAATTGTTC CTGGGCACAA GGACTTAATT TTCAGAAGAG 
30 TAATCATGAT CACTACAGAT GAACCCAATG TGCCAACTTC 
AGAAGATTTT TACATTTTGA AGAAGGGGAG CAAATCTAAA 
GAGAGTTAAC ATATGGTGGA TTATGTTGAT TTAGAACTTA 

35 

GTTCAAAATG TAGATTTGAG TCCAGTTTGG ATGTGTGATT 
TTTAAAAGTA GTATTCATGA TTTCTGGCTT TTGATTTGCC 
40 TTAAGAAAAT TATGGCTGTT GCTGTCATTA CATATCTATT 
TAGACAATTT TGTAATTAGG TGAACTCTAA AACTGCAACA 
ATACAATGAT TAT 

45 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 
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TGTCATGGAT ATAAAAGTTC 1980 

TACTATGAAA AAGGAGAAAA 2040 

ATTGTGCAGT GTTCAGAGAT 2100 

AGAAGATGTG GGAAATCACA 2160 

TACGGTTCTG GCCATCAATT 2220 

ATGGCCTATG AGCAAAGTAA 22 BO 

TTGTGTGATT GTTTCCTGGA 2340 

TGAGTGGAAA AATCTTAATG 2400 

TAAGAAGTAT TATATCCATG 2460 

CCCAATATTT ATGGAAGGAG 2520 

TATTGAAAAA CACCAGAGTG 2580 

TTCCATCTTA TTGCTTGGAA 2640 

GGAAGATGTT CCGAACCCCA 2700 

AACGGACATT CTTTGAAGTC 2760 

CCAACAGTCT AT AG AG TATT 2820 

AAAAATTCAG TTGAACTTCT 2680 

AAATAGATGT GTAAATTTGG 2940 

AATTTTCAAA TCATCTAAAG 30 00 

ATATTCCTGG TCATAAAACA 3060 

AAATGTCATC AAATATGTAG 3120 

TCTGACAAAT TGCTTTAAAA 3180 

3193 
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Ul) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
1 5 10 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
SO 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lya Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 



\ 
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Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 

24S 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 265 



10 



Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 



15 



Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 

305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 

325 330 335 



20 



Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 



He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 ' 365 



25 



Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 



30 



Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 



35 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 



Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 



40 



Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 



45 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 

465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 



50 



Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 



He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 
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Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu lie Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
5B0 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 600 
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Phe lie Pro lie Glu Lys Tyr Gin Phe Ser Leu Tyr Pro lie Phe Met 

805 810 815 

Glu Gly Val Gly Lys Pro Lys lie lie Asn Ser Phe Thr Gin Asp Asp 
5 820 B25 830 

lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

10 He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 

850 955 860 



15 



30 



Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Lys Arg Leu Ser He 

885 890 895 



Phe Leu Ser Ser He Gin His Gin His Val Val Leu Phe Phe Trp Ser 
20 900 905 910 

Leu Lys Gin Phe Gin Lys He Ser Val Leu He His His Gly Lys He 
915 920 925 

25 Lys Met Arg Cys Gin Gin Leu Trp Ser Leu Tyr Phe Gin Gin Gin He 

930 935 940 



Leu Lys Arg Val Leu Phe Val Leu Val Thr Ser Ser Thr Val Leu Thr 
945 950 955 960 

Ser Leu Arg Leu Arg Val Leu Arg Pro Met Arg Thr Lys Ala Arg Asp 

965 970 975 



Asn Pro Leu Leu Asn Thr Pro Arg Ser Ala Thr Leu Asn Gin Val Lys 
35 980 985 990 

Leu Val Lys 
995 

40 (2) INFORMATION FOR SEQ ID NO: 4: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3063 base pairs 

(B) TYPE: nucleic acid 
45 tC) STRAND EDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: CDNA 



50 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA 60 

5 GTAAGATGAT TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 120 

TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG TCTTGCATGC 180 

CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA 240 

10 ATTCGAATGG ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGTACTCACT 300 

TTTCTAACTT ATCCAAAACA ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT 360 

15 GCTCCTTATG TGCAGACAAC ATTGAAGGAA AGACATTTGT TTCAACAGTA AATTCTTTAG 420 

TTTTTCAACA AATAGATGCA AACTGGAACA TACAGTGCTG GCTAAAAGGA GACTTAAAAT 480 

TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT CAGGAATTAT AACTATAAGG 540 

20 

TCCATCTTTT ATATGTTCTG CCTGAAGTGT TAGAAGATTC ACCTCTGGTT CCCCAAAAAG €00 

GCAGTTTTCA GATGGTTCAC TGCAATTGCA GTGTTCATGA ATGTTGTGAA TGTCTTGTGC 660 

25 CTGTGCCAAC AGCCAAACTC AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 720 

GAGTAATTTT CCAGTCACCT CTAATGTCAG TTCAGCCCAT AAATATGGTG AAGCCTGATC 780 

CACCATTAGG TTTGCATAT/G GAAATCACAG ATGATGGTAA TTTAAAGATT TCTTGGTCCA 840 

30 

GCCCACCATT GGTACCATTT CCACTTCAAT ATCAAGTGAA ATATTCAGAG AATTCTACAA 900 

CAGTTATCAG AGAAGCTGAC AAGATTGTCT CAGCTACATC CCTGCTAGTA GACAGTATAC 960 

35 TTCCTGGGTC TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 1020 

GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC TTTCCACCTA 1080 

AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG CATCTATAAG AAGGAAAACA 1140 

40 

AGATTGTTCC CTCAAAAGAG ATTGTTTGGT GGATGAATTT AGCTGAGAAA ATTCCTCAAA 1200 

GCCAGTATGA TGTTGTGAGT GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 1260 

45 CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 1320 

ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA TGTGAAACTG 1380 

ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG 1440 

50 „ _ 

AAAGCACTTT GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 1500 

TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA 1560 
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TTTTCCAGCC AATCTTCCTA TTATCTGGCT 
TAGGTTCACT TGACTCTCCA CCAACATGTG 

5 

CTCCATCCAG TGTGAAAGCA GAAATTACTA 
AAAAGCCAGT CTTTCCAGAG AATAACCTTC 
10 AAGAAGTACA ATGGAAGATG TATGAGGTTT 
CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 
TGGGATATTG GAGTAATTGG AGCAATCCAG 

15 

CTATGAGAGG ACCTGAATTT TGGAGAATAA 
ATGTCACTTT ACTTTGGAAG CCCCTGATGA 
20 ATGTGATAAA CCATCATACT TCCTGCAATG 
CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 
CAATTGGTGC TTCTGTTGCA AATTTTAATT 

25 

ATATCGTGCA GTCACTCAGT GCTTATCCTT 
TACTATCACC CAGTGATTAC AAGCTAATGT 
30 AAGATGGTGA AATAAAATGG CTTAGAATCT 
ATCATTTTAT CCCCATTGAG AAGTACCAGT 
TGGGAAAACC AAAGATAATT AATAGTTTCA 

35 

ATGCAGGTTT ATATGTAATT GTGCCAGTAA 
CATTATTAAT ATCACACCAA AGAATGAAAA 
40 AGAATTGTTC CTGGGCACAA GGACTTAATT 
CAAGCATACA GCATCAGTGA CATGTGGTCC 
AGATATCAGT GTTGATACAT CATGGAAAAA 

45 

CTCTCTACTT TCAACAACAG ATCTTGAAAA 
CAGTGTTAAC TTCTCTGAGG CTGAGGGTAC 
50 ACAACCCTTT GTTAAATACG CCACGCTGAT 
AGA 
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ACACAATGTG GATTAGGATC AATCACTCTC 1620 

TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 1680 

TAAACATTGG ATTATTGAAA ATATCTTGGG 1740 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA 1800 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC 1860 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 1920 

CCTACACAGT TGTCATGGAT ATAAAAGTTC 1980 

TTAATGGAGA TACTATGAAA AAGGAGAAAA 2040 

AAAATGACTC ATTGTGCAGT GTTCAGAGAT 2100 

GAACATGGTC AGAAGATGTG GGAAATCACA 2160 

CACATACTGT TACGGTTCTG GCCATCAATT 2220 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA 2280 

TAAACAGCAG TTGTGTGATT GTTTCCTGGA 2340 

ATTTTATTAT TGAGTGGAAA AATCTTAATG 2400 

CTTCATCTGT TAAGAAGTAT TATATCCATG 2460 

TCAGTCTT7A CCCAATATTT ATGGAAGGAG 2520 

CTCAAGATGA TATTGAAAAA CACCAGAGTG 2580 

TTATTTCCTC TTCCATCTTA TTGCTTGGAA 2640 

AGCTATTTTG GGAAGATGTT CCGAACCCCA 2700 

TTCAGAAGAA ACGTTTGAGC ATCTTTTTAT 2760 

TCTTCTTTTG GAGCCTGAAA CAATTTCAGA 2820 

TAAAGATGAG ATGATGCCAA CAACTGTGGT 2880 

GGGTTCTGTT TGTTTTAGTG ACCAGTTCAA 2940 

TGAGGTAACC TATGAGGACG AAAGCCAGAG 3000 

CAGCAACTCT AAACCAAGTG AAACTGGTGA 3060 

3063 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii» MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
I 5 10 15 

Tyr Val He Thr Ala Phe Aan Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
€5 10 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

65 90 95 

Arg Aan Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 

Gin Lya Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 ISO 

Cys Cya Glu Cys Leu val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 \ 205 



\ 
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Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys lie Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 



15 



Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 
275 280 285 



Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
290 295 300 



20 



Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 



25 



Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Ptae 

325 330 335 

Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 



30 



He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 



Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 



35 



Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 



40 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn He 

420 425 430 



45 



Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 



Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 4S5 460 



50 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser He His 
465 470 475 460 



97/25424 



PCT/DS97/00128 



- 108 - 



Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

lie Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu lie Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 
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10 



25 



40 



50 
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Tyr Phe lie lie Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu lie Lys 
770 775 780 

Trp Leu Arg lie Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
765 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 

805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 



He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
15 835 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
B50 855 660 

20 Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 

865 870 875 880 



Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Met Leu Glu Gly Ser 

865 890 895 

Met Phe Val Lys Ser His His His Ser Leu He Ser Ser Thr Gin Gly 
900 905 910 



His Lys His Cys Gly Arg Pro Gin Gly Pro Leu His Arg Lys Thr Arg 
30 915 920 925 

Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Leu Pro Pro Leu Leu Ser 
930 935 940 

35 Tyr Asp Pro Ala Lys Ser Pro Ser Val Arg Asn Thr Gin Glu Ser He 

945 950 955 960 



Lys Lys Lys Lys Lys Lys Leu Glu Gly 

965 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



\ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met He Cya Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
1 5 10 15 

Tyr Val He Thr Ala Phe Asn Leu Sor Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 B0 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 90 9* 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
1B0 185 190 

Cys Cya Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 
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Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 

275 260 265 

Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 

290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly lie Trp Ser 

305 310 315 320 



10 



Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 

325 330 335 



15 



Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 



20 



Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 



Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 



25 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cya Asn Glu His 

405 410 415 



30 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 



35 



Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser Zle His 
465 470 475 480 



40 



Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 



45 



Glu Cys Zle Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 



50 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 



Ala Glu Zle Thr He Asn tie Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 5S5 560 
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Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 6" 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr val He Asn His His Thr Ser Cys Asn 
675 6B0 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
** 720 



705 



Gly 



710 



715 



Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 



725 



730 



735 



Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 

805 810 815 

Glu Gly val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 
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He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
635 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
5 650 855 860 

Gin Arg Ket Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
665 870 875 860 

10 Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Met Leu Glu Gly Ser 

885 890 895 



15 



35 



Met Phe Val Lys Ser His His His Ser Leu He Ser Ser Thr Gin Gly 
900 905 910 

His Lys His Cys Gly Arg Pro Gin Gly Pro Leu His Arg Lys Thr Arg 
915 920 925 



Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Leu Pro Pro Leu Leu Ser 
20 930 935 940 

Tyr Asp Pro Ala Lys Ser Pro Ser Val Arg Asn Thr Gin Glu Ser He 
945 950 955 960 

25 Lys Lys Lys Lys Lys Lys Leu Glu Gly 

965 

(2) INFORMATION FOR SEQ ID NO: 7: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
15 10 15 

45 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
50 35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 
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Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 50 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp Ala Asn Trp Asn 
115 120 125 

lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
US ISO 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
100 185 190 

Cvs Cvs Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
Y 195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 

325 330 335 
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Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu A3n Lys He Val Pro Ser Lys Glu lie Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 



10 



Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 



15 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 



20 



Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 



Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 



25 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 
465 4*70 475 480 



30 



Pro lie Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 



35 



He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 



40 



Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 



45 



Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 



50 



Ser Lys Ser val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 



Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 
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Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp lie Lya Val Pro Met 
625 €30 635 640 

Arg Gly Pro Glu Phe Trp Arg He lie Asn Gly Asp Thr Met Lys Lya 

645 650 655 

Glu Lya Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lya Aan Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Aan Hia His Thr Ser Cys Aan 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lya 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 600 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 

B05 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 

He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
850 855 860 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 B80 

Cvs Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Pro Glu Thr Phe Glu 

885 890 895 
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His Leu Phe He Lys His Thr Ala Ser Val Thr Cys Gly Pro Leu Leu 
900 905 910 

Leu Glu Pro Glu Thr He Ser Glu Asp He Ser Val Asp Thr Ser Trp 
915 920 925 

Lys Asn Lys Asp Glu Met Met Pro Thr Thr Val Val Ser Leu Leu Ser 
930 935 940 

Thr Thr Asp Leu Glu Lys Gly Ser Val Cys He Ser Asp Gin Phe Asn 
945 950 955 960 

Ser Val Asn Phe Ser Glu Ala Glu Gly Thr Glu Val Thr Tyr Glu Asp 

965 970 975 

Glu Ser Gin Arg Gin Pro Phe Val Lys Tyr Ala Thr Leu He Ser Asn 
980 985 990 

Ser Lys Pro Ser Glu Thr Gly Glu Glu Gin Gly Leu He Asn Ser Ser 
995 1000 1005 

Val Thr Lys Cys Phe Ser Ser Lys Asn Ser Pro Leu Lys Asp Ser Phe 
1010 1015 1020 

Ser Asn Ser Ser Trp Glu He Glu Ala Gin Ala Phe Phe He Leu Ser 
1025 1030 1035 1040 

Asp Gin His Pro Asn He He Ser Pro His Leu Thr Phe Ser Glu Gly 

1045 1050 1055 

Leu Asp Glu Leu Leu Lys Leu Glu Gly Asn Phe Pro Glu Glu Asn Asn 
1060 1065 1070 

Asp Lys Lys Ser He Tyr Tyr Leu Gly Val Thr Ser He Lys Lys Arg 
1075 1080 1085 

Glu Ser Gly Val Leu Leu Thr Asp Lys Ser Arg Val Ser Cys Pro Phe 
1090 1095 1100 

Pro Ala Pro Cys Leu Phe Thr Asp He Arg Val Leu Gin Asp Ser Cys 
1105 1110 1115 1120 

Ser His Phe Val Glu Asn Asn He Asn Leu Gly Thr Ser Ser Lys Lys 

1125 1130 1135 

Thr Phe Ala Ser Tyr Met Pro Gin Phe Gin Thr Cys Ser Thr Gin Thr 
1140 1145 1150 

His Lys He Met Glu Asn Lys Met Cys Asp Leu Thr Val Phe His Arg 
1155 1160 1165 



Asn Leu Gin He Cys Val He Met Gly Asn He Lys Cys Asn Arg Leu 
1170 1175 HBO 



10 



25 



30 



35 
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Leu Trp Val Gly Glu Arg Lys Glu Thr Arg Val Lys Phe Glu Asn Asn 
U85 1190 1195 1200 

Cys Ser Lys Lys Lys Lys Lys Lys Asn Ser Arg Pro Ala Arg Pro Asp 

1205 1210 1215 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3599 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

40 TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 

AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

45 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

50 TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 
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ATCAAGTGAA ATATTCAGAG 
CAGCTACATC CCTGCTAGTA 
5 GGGGCAAGAG ACTGGATGGC 
CCACACAAGA TGTCATATAC 
CTTTTCACTG CATCTATAAG 

10 

GGATGAATTT AGCTGAGAAA 
GCAAAGTTAC TTTTTTCAAT 
15 CAGTGTACTG CTGCAATGAA 
ATGTCAATAT CAATATCTCA 
GGTCAACCAG TACAATCCAG 

20 

GCAGCCTTTA CTGTTCTGAT 
ATTTGCAGAG TGATGGTTTT 
25 ACACAATGTG GATTAGGATC 
TCCTTCCTGA TTCTGTGGTG 
TAAACATTGG ATTATTGAAA 

30 

AATTCCAGAT TCGCTATGGT 
ATGATGCAAA ATCAAAATCT 
35 TTCAGGTGCG CTGTAAGAGG 
CCTACACAGT TGTCATGGAT 
TTAATGGAGA TACTATGAAA 

40 

AAAATGACTC ATTGTGCAGT 
GAACATGGTC AGAAGATGTG 
45 CACATACTGT TACGGTTCTG 
TAACCTTTTC ATGGCCTATG 
TAAACAGCAG TTGTGTGATT 

50 

ATTTTATTAT TGAGTGGAAA 
CTTCATCTGT TAAGAAGTAT 
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AATTCTACAA CAGTTATCAG AGAAGCTGAC 
GACAGTATAC TTCCTGGGTC TTCGTATGAG 
CCAGGAATCT GGAGTGACTG GAGTACTCCT 
TTTCCACCTA AAATTCTGAC AAGTGTTGGG 
AAGGAAAACA AGATTGTTCC CTCAAAAGAG 
ATTCCTCAAA GCCAGTATGA TGTTGTGAGT 
CTGAATGAAA CCAAACCTCG AGGAAAGTTT 
CATGAATGCC ATCATCGCTA TGCTGAATTA 
TGTGAAACTG ATGGGTACTT AACTAAAATG 
TCACTTGCGG AAAGCACTTT GCAATTGAGG 
ATTCCATCTA TTCATCCCAT ATCTGAGCCC 
TATGAATGCA TTTTCCAGCC AATCTTCCTA 
AATCACTCTC TAGGTTCACT TGACTCTCCA 
AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
ATATCTTGGG AAAAGCCAGT CTTTCCAGAG 
TTAAGTGGAA AAGAAGTACA ATGGAAGATG 
GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA 
CTAGATGGAC TGGGATATTG GAGTAATTGG 
ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
AAGGAGAAAA ATGTCACTTT ACTTTGGAAG 
GTTCAGAGAT ATGTGATAAA CCATCATACT 
GGAAATCACA CGAAATTCAC TTTCCTGTGG 
GCCATCAATT CAATTGGTGC TTCTGTTGCA 
AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
GTTTCCTGGA TACTATCACC CAGTGATTAC 
AATCTTAATG AAGATGGTGA AATAAAATGG 
TATATCCATG ATCATTTTAT CCCCATTGAG 



PCT/US97/00128 

AAGATTGTCT 960 

GTTCAGGTGA 1020 

CGTGTCTTTA 1080 

TCTAATGTTT 1140 

ATTGTTTGGT 1200 

GATCATGTTA 1260 

ACCTATGATG 1320 

TATGTGATTG 1380 

ACTTGCAGAT 1440 

TATCATAGGA 1500 

AAAGATTGCT 1560 

TTATCTGGCT 1620 

CCAACATGTG 1680 

GAAATTACTA 1740 

AATAACCTTC 1800 

TATGAGGTTT 1860 

GTCTATGCTG 1920 

AGCAATCCAG 1980 

TGGAGAATAA 2040 

CCCCTGATGA 2100 

TCCTGCAATG 2160 

ACAGAGCAAG 2220 

AATTTTAATT 2280 

GCTTATCCTT 2340 

AAGCTAATGT 2400 

CTTAGAATCT 2460 

AAGTACCAGT 2520 
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TCAGTCTTTA CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 2580 

CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT GTGCCAGTAA 2640 

TTATTTCCTC TTCCATCTTA TTGCTTGGAA CATTATTAAT ATCACACCAA AGAATGAAAA 2700 

AGCTATTTTG GGAAGATGTT CCGAACCCCA AGAATTGTTC CTGGGCACAA GGACTTAATT 2760 

TTCAGAAGCC AGAAACGTTT GAGCATCTTT TTATCAAGCA TACAGCATCA GTGACATGTG 2820 

GTCCTCTTCT TTTGGAGCCT GAAACAATTT CAGAAGATAT CAGTGTTGAT ACATCATGGA 2880 

AAAATAAAGA TGAGATGATG CCAACAACTG TGGTCTCTCT ACTTTCAACA ACAGATCTTG 2940 

AAAAGGGTTC TGTTTGTATT AGTGACCAGT TCAACAGTGT TAACTTCTCT GAGGCTGAGG 3000 

GTACTGAGGT AACCTATGAG GACGAAAGCC AGAGACAACC CTTTGTTAAA TACGCCACGC 3060 

TGATCAGCAA CTCTAAACCA AGTGAAACTG GTGAAGAACA AGGGCTTATA AATAGTTCAG 3120 

TCACCAAGTG CTTCTCTAGC AAAAATTCTC CGTTGAAGGA TTCTTTCTCT AATAGCTCAT 31B0 

GGGAGATAGA GGCCCAGGCA TTTTTTATAT TATCGGATCA GCATCCCAAC ATAATTTCAC 3240 

CACACCTCAC ATTCTCAGAA GGATTGGATG AACTTTTGAA ATTGGAGGGA AATTTCCCTG 3300 

AAGAAAATAA TGATAAAAAG TCTATCTATT ATTTAGGGGT CACCTCAATC AAAAAGAGAG 3360 

AGAGTGGTGT GCTTTTGACT GACAAGTCAA GGGTATCGTG CCCATTCCCA GCCCCCTGTT 3420 

TATTCACGGA CATCAGAGTT CTCCAGGACA GTTGCTCACA CTTTGTAGAA AATAATATCA 3480 

ACTTAGGAAC TTCTAGTAAG AAGACTTTTG CATCTTACAT GCCTCAATTC CAAACTTGTT 3540 

CTACTCAGAC TCATAAGATC ATGGAAAACA AGATGTGTGA CCTAACTGTG TAATCTAGA 3599 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 
<A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi> SEQUENCE DESCRIPTION; SEQ ID NO:9: 
NNNNNTACCT TTTCCAG 17 
5 (2) INFORMATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



25 



40 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

20 Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 

15 10 15 



Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro lie Thr Pro Trp Arg 
20 25 30 

Phe Lya Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 



Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
30 50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

35 Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 90 95 



Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 

100 105 110 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 



He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
45 130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 

145 150 155 160 

50 Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 
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Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cya Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys lie Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu val Pro Phe Pro Leu Gin Tyr Gin Val Ly3 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 2B0 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 

325 330 335 

Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala val Tyr Cys Cys Asn Glu His 

405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 
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Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser He His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

4B5 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 



15 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 



Ala Glu He Thr He Asn lie Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 



20 



Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 



25 



Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr A3p Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 



30 



Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 



Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 



35 



Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 



40 



Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 6B0 665 



45 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 



50 



Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser lie 

705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 
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Lys Val Asn He Val Gin Ser Leu Ser Ma Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
765 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 

B05 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin A3p Asp 
820 825 830 

He Glu Lys His Gin Ser Asp 
835 

(2) INFORMATION FOR SEQ ID NO: 11: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2624 base pairs 

(B) TYPE: nucleic acid 
{C) STRAND EDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 
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AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

5 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

10 TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCA7T GGTACCATTT CCACTTCAAT 900 

ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

15 

CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGGGCAAGAG ACTGGATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

20 CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

25 

GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

CAGTGTACTG CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 1380 

30 ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT 1440 

GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 

35 

ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 

ACACAATGTG GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

40 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA 1740 

TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG TATGAGGTTT 1860 

45 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

50 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 

TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 
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AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT TCCTGCAATG 2160 

GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 

CACATACTGT TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 

TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 2400 

ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG CTTAGAATCT 2460 

CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT 2520 

TCAGTCTTTA CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 2580 
CTCAAGATGA TATTGAAAAA CACCAGAGTG ATTGATAAGG ATCC 
(2> INFORMATION FOR SEQ ID NO: 12: 

(i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2948 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



2624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CCATTGAAGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGGATTT 


CCAAAATGTC 


60 


GTAATAACCC CGCCCCGTTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG 


GAGGTCTATA 


120 


TAAGCAGAGC TCGTTTAGTG AACCGTCAGA TCTCTAGAAG CTGGGTACCA 


GCTGCTAGCA 


180 


AGCTTGCTAG CGGCCGCCAG TGTGATGGAT ATCTGCAGAA TTCGGCTTTC 


TCTGCCTTCG 


240 


GTCGAGTTGG ACCCCCGGAT CAAGGTGTAC TTCTCTGAAG TAAGATGATT 


TGTCAAAAAT 


300 


TCTGTGTGGT TTTGTTACAT TGGGAATTTA TTTATGTGAT AACTGCGTTT 


AACTTGTCAT 


360 


ATCCAATTAC TCCTTGGAGA TTTAAGTTGT CTTGCATGCC ACCAAATTCA 


ACCTATGACT 


420 


ACTTCCTTTT GCCTGCTGGG CTCTCAAAGA ATACTTCAAA TTCGAATGGA 


CATTATGAGA 


480 


CAGCTGTTGA ACCTAAGTTT AATTCAAGTG GTACTCACTT TTCTAACTTA 


TCCAAAACAA 


540 


CTTTCCACTG TTGCTTTCGG AGTGAGCAAG ATAGAAACTG CTCCTTATGT 


GCAGACAACA 


600 
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TTGAAGGAAA GACATTTGTT TCAACAGTAA 
ACTGGAACAT ACAGTGCTGG CTAAAAGGAG 
5 CATTATTTAA GAATCTATTC AGGAATTATA 
CTGAAGTGTT AGAAGATTCA CCTCTGGTTC 
GCAATTGCAG TGTTCACGAA TGTTGTGAAT 

10 

ACGACACTCT CCTTATGTGT TTGAAAATCA 
TAATGTCAGT TCAGCCCATA AATATGGTGA 
15 AAATCACAGA TGATGGTAAT TTAAAGATTT 
CACTTCAATA TCAAGTGAAA TATTCAGAGA 
AGATTGTCTC AGCTACATCC CTGCTAGTAG 

20 

TTCAGGTGAG GGGCAAGAGA CTGGATGGCC 
GTGTCTTTAC CACACAAGAT GTCATATACT 
25 CTAATGTTTC TTTTCACTGC ATCTATAAGA 
TTGTTTGGTG GATGAATTTA GCTGAGAAAA 
ATCATGTTAG CAAAGTTACT TTTTTCAATC 

30 

CCTATGATGC AGTGTACTGC TGCAATGAAC 
ATGTGATTGA TGTCAATATC AATATCTCAT 
35 CTTGCAGATG GTCAACCAGT ACAATCCAGT 
ATCATAGGAG CAGCCTTTAC TGTTCTGATA 
AAGATTGCTA TTTGCAGAGT GATGGTTTTT 

40 

TATCTGGCTA CACAATGTGG ATTAGGATCA 
CAACATGTGT CCTTCCTGAT TCTGTGGTGA 
45 AAATTACTAT AAACATTGGA TTATTGAAAA 
ATAACCTTCA ATTCCAGATT CGCTATGGTT 
ATGAGGTTTA TGATGCAAAA TCAAAATCTG 

50 

TCTATGCTGT TCAGGTGCGC TGTAAGAGGC 
GCAATCCAGC CTACACAGTT GTCATGGATA 



127 - 

ATTCTTTAGT TTTTCAACAA ATAGATGCAA 660 

ACTTAAAATT ATTCATCTGT TATGTGGAGT 720 

ACTATAAGGT CCATCTTTTA TATGTTCTGC 780 

CCCAAAAAGG CAGTTTTCAG ATGGTTCACT 640 

GTCTTGTGCC TGTGCCAACA GCCAAACTCA 900 

CATCTGGTGG AGTAATTTTC CAGTCACCTC 960 

AGCCTGATCC ACCATTAGGT TTGCATATGG 1020 

CTTGGTCCAG CCCACCATTG GTACCATTTC 1080 

ATTCTACAAC AGTTATCAGA GAAGCTGACA 1140 

ACAGTATACT TCCTGGGTCT TCGTATGAGG 1200 

CAGGAATCTG GAGTGACTGG AGTACTCCTC 1260 

TTCCACCTAA AATTCTGACA AGTGTTGGGT 1320 

AGGAAAACAA GATTGTTCCC TCAAAAGAGA 1380 

TTCCTCAAAG CCAGTATGAT GTTG1GAGTG 1440 

TGAATGAAAC CAAACCTCGA GGAAAGTTTA 1500 

ATGAATGCCA TCATCGCTAT GCTGAATTAT 1560 

GTGAAACTGA TGGGTACTTA ACTAAAATGA 1620 

CACTTGCGGA AAGCACTTTG CAATTGAGGT 1660 

TTCCATCTAT TCATCCCATA TCTGAGCCCA 1740 

ATGAATGCAT TTTCCAGCCA ATCTTCCTAT 1800 

ATCACTCTCT AGGTTCACTT GACTCTCCAC 1860 

AGCCACTGCC TCCATCCAGT GTGAAAGCAG 1920 

TATCTTGGGA AAAGCCAGTC TTTCCAGAGA 1980 

TAAGTGGAAA AGAAGTACAA TGGAAGATGT 2040 

TCAGTCTCCC AGTTCCAGAC TTGTGTGCAG 2100 

TAGATGGACT GGGATATTGG AGTAATTGGA 2160 

TAAAAGTTCC TATGAGAGGA CCTGAATTTT 2220 
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GGAGAATAAT 


TAATGGAGAT ACTATGAAAA AGGAGAAAAA 


TGTCACTTTA 


CTTTGGAAGC 


2280 


CCCTGATGAA 


AAATGACTCA TTGTGCAGTG TTCAGAGATA 


TGTGATAAAC 


CATCATACTT 


2340 


CCTGCAATGG AACATGGTCA GAAGATGTGG GAAATCACAC 


GAAATTCACT 


TTCCTGTGGA 


2400 


CAGAGCAAGC 


ACATACTGTT ACGGTTCTGG CCATCAATTC 


AATTGGTGCT 


TCTGTTGCAA 


2460 


ATTTTAATTT 


AACCTTTTCA TGGCCTATGA GCAAAGTAAA 


TATCGTGCAG 


TCACTCAGTG 


2520 


CTTATCCTTT 


AAACAGCAGT TGTGTGATTG TTTCCTGGAT 


ACTATCACCC 


AGTGATTACA 


25B0 


AGCTAATGTA 


TTTTATTATT GAGTGGAAAA ATCTTAATGA AGATGGTGAA ATAAAATGGC 


2640 


TTAGAATCTC 


TTCATCTGTT AAGAAGTATT ATATCCATGA 


TCATTTTATC 


CCCATTGAGA 


2700 


AGTACCAGTT 


CAGTCTTTAC CCAATATTTA TGGAAGGAGT 


GGGAAAACCA 


AAGATAATXA 


2760 


ATAGTTTCAC 


TCAAGATGAT ATTGAAAAAC ACCAGAGTGA 


TGCAGGTGAC 


TACAAGGACG 


2820 


ACGATGACAA 


GTAGGGATCC AGACATGATA AGATACATTG 


ATGAGTTTGG 


ACAACCCACA 


28B0 


ACTAGAATGC 


AGTGAAAAAA ATGCTTTATT TGTGAAATTT 


GTGATGCTAT 


TGCTTTATTT 


2940 


GTAACCAT 








294B 



<2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
<D> TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
15 io 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cya Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 
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Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 90 95 

Arg Asn Cys Ser. Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 110 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 



15 



He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 



Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 



20 



Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 



25 



Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 



30 



Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 



Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 



35 



Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 



40 



Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 



45 



Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 



Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 



50 



Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 

325 330 335 
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Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 360 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
43S 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser He His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 
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25 



45 



Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val Zle Asn His His Thr Ser Cys Asn 
675 680 685 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
15 690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

20 Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 



Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 

755 760 765 



Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
30 770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Gly Lys 
785 790 795 800 

35 Phe Thr He Leu 



(2) INFORMATION FOR SEQ ID HO: 14: 

40 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2507 base pairs 
[B> TYPE: nucleic acid 
tC) STRAND EDNESS : single 
(D) TOPOLOGY: linear 



(il) MOLECULE TYPE: cDNA 



50 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 

AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 

ATCAAGTGAA AT AT TC AG AG AATTCTACAA CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGGGCAAGAG ACTGGATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

CAGTGTACTG CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 1380 

ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT 1440 

GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 
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10 



15 



20 



25 



30 



35 



40 



45 



50 
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ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 

ACACAATGTG GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA 1740 

TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG TATGAGGTTT 1860 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 

TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 

AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT TCCTGCAATG 2160 

GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 

CACATACTGT TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 

TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 2400 

ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG CTTAGAATCT 2460 

CTTCATCTGT TAAGAAGTAT TATATCCATG GTAAGTTTAC TATACTT 2507 
(2) INFORMATION FOR SEQ ID NO; 15: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTAAGTTATT TGNNNNNATA TCCTAACAG 29 
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12) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTAAGCATTA GCNNMNNTTT TAAATTCAG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTAAGTACCA AANNNNNTTT TCAATATAG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:18: 
GTAAGTTATG CANNNNNTTT TTCCTTAAG 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTAAGTATAT TTNNNNAATA TTTAACAG 28 
(2) INFORMATION FOR SEQ ID NO: 20: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GTAGGTTATG TANNNNNCCC TCATTACAG 29 

35 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
(B> TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
50 GTAAGAAAAC AGNNNNNTGT TTCAAATAG 



29 
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(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GTACGTATTA TTNNNNNTAT CTTTTAAAG 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTATGTCAAG CTNNNNNAAA AATTTCTAG 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTACCTTTTA CTNNNNNCTT ATTTTACAG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GTCTGCAGAG ATNNNNNGTC ATTTTGCAG 



29 



(2) INFORMATION FOR SEQ ID NO: 26: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTATTCCCAA TTNNNNNTAT TTACTACAG 29 
35 (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
40 (C> STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
50 GTATTCCCAA TTNNNNNTAT TTACTACAG 



29 
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(2) INFORMATION FOR SEQ ID NO:2B: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

10 



(ati) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

15 

GTAAGTTTAC TANNNNNTTT TCTCCTCAG 
(2> INFORMATION FOR SEQ ID NO: 29: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: CDNA 



3C * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GTAAAAATTA TANNNNNTTT CTTTTTCAG 29 
35 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
40 {C> STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
50 GTATTGTACT TGNNNNNTAT CCTTTGTAG 



29 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTTGCTTTTT CANNNNNTTA TCTAAACAG 29 
(2) INFORMATION FOR SEQ ID NO: 32: 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTACATTTGT CTNNNNNCTT TTCTTTTAG 29 
35 (2) INFORMATION FOR SEQ ID NO: 33: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
50 GTATOCAGTG TTNNNNNCTT TTTAAACAG 29 



\ 
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CLAIMS 



10 



1. An OB receptor protein preparation 
containing an OB receptor protein, optionally in a 
pharmaceutical ly acceptable formulation, said OB 
receptor protein having part or all of the amino acid 
sequence according to Seq. ID No. 1 and one or more of 
the biological properties of naturally occurring OB 
receptor protein. 



2. An OB receptor protein preparation 
containing an OB receptor protein, optionally in a 
pharmaceutically acceptable formulation, wherein said OB 
15 receptor protein amino acid sequence is selected from 

among amino acid sequences (according to Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896 optionally with an N-terminal 

methionyl residue; 
20 (c) 23-896 optionally with an N-terminal 

methionyl residue; 

(d) 29-896 optionally with an N-terminal 

methionyl residue; 

(e) 1-839; 

25 (f) 22-839 optionally with an N-terminal 

methionyl residue; 

<g) 29-839 optionally with an N-terminal 

methionyl residue; 

(h) 1-841; 

3 q (i) 22-841 optionally with an N-terminal 

methionyl residue; 

(j) 23-841 optionally with an N-terminal 

methionyl residue; 

(k) 29-841 optionally with an N-terminal 

35 methionyl residue; 

(1) 1-891; 
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(m) 22-891 optionally with an N-terminal 
methionyl residue; 

<n) 23-891 optionally with an N-terminal 
methionyl residue; 
5 (o) 29-891 optionally with an N-terminal 

methionyl residue; 

(p) of subparts (1) through (o) further 
having the C-terminal amino acids, beginning at position 
892, of OB receptor B (Seq. ID No. 3) or C (Seq. ID. No. 
10 5); and, 

(q) a chemically modified derivative of 
any of subparts (a) through (p) . 



3. An OB receptor protein preparation of 
15 claim 2 wherein said OB receptor protein is further 

selected from among the OB receptor proteins of subparts 
(1) through (o) further having the C-terminal amino 
acids, beginning at position 892, of OB receptor protein 
D (Seq. ID No. 1) . 

20 

4. An OB receptor protein preparation of 
claim 2 wherein said OB receptor protein is further 
selected from among the OB receptor proteins of subparts 
(1) through (o) further having substituted the C- 

25 terminal amino acids, beginning at position 799, G K F T 
I L (Seq. ID No. 13) . 
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5. An OB receptor protein preparation 
according to any of claims 1 through 4, wherein the 
extracellular domain of said OB receptor protein is 
5 modified, said modification selected from among: 

(a) deletion of all or part of the 

random coil domain; 

(b) modification of one or both ^WSXWS" 
boxes by substition of the first serine with another 

10 amino acid; 

(c) modification of one or both *WSXWS" 
boxes by substitution of the last serine with another 
amino acid; and 

(d) modification of one or both W WSXWS" 
15 boxes by substitution of the first tryptophan with 

another amino acid. 

6. A DNA molecule encoding an OB receptor 
protein according to any of claims 1-5 selected from the 

group consisting of: 

(a) the DNA sequences set forth in Seq. 

ID nos. 2, 4, 6, 8, 11, 12, and 14; 

(b) a DNA which selectively hybridizes 

to a DNA of subpart (a) ; and 

(c) a DNA which, but for the degeneracy 
of the genetic code would hybridize to a DNA of subpart 
(a) or (b) . 

7. A biologically functional viral or 
30 plasmid vector containing a DNA of claim 6. 

8. A procaryotic or eucaryotic host cell 
containing the vector of claim 7, 



20 



25 



35 



9. A host cell modified so that expression 
of endogenous OB receptor protein is enhanced. 
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10. A host cell of claim 9 which is an 
isolated human host cell . 

5 11. A process for producing an OB receptor 

protein comprised of culturing, under suitable 
conditions/ a host cell according to any of claims 8, 9 
or 10, obtaining the OB receptor produced, and 
optionally preparing a pharmaceutical composition 
10 containing said OB receptor. 

12. A method of treating an individual for a 
therapeutic disorder selected from among obesity, 
diabetes, high blood lipid levels, and high cholesterol 
15 levels comprised of administering a therapeutic amount 
of an OB receptor protein preparation containing an OB 
receptor protein according to any of claims 1-5, or 
produced by the process according to claim 11. 

20 13. A method of treating an individual for 

weight loss or weight maintenance for solely cosmetic 
purposes comprised of administering an effective amount 
of an OB receptor preparation containing an OB receptor 
protein according to any of claims 1-5, or produced by 

25 the process according to claim 11. 

14. Use of an OB receptor protein according 
to claims 1-5, or produced by the process of claim 11, 
for manufacturing a medicament for the treatment of 
30 obesity, diabetes, high blood lipid levels, or high 
cholesterol levels. 
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15. An OB protein/OB receptor protein complex 
preparation, containing an OB protein moiety and an OB 
receptor protein moiety, optionally in a pharmaceuti- 

5 cally acceptable formulation, wherein: 

(a) said OB receptor protein is 
selected from among those set forth in any of claims 1 
and 2; 

(b) said OB protein moiety is selected 

10 from among: 

(i) a naturally ocurring OB 

protein; and, 

(ii) a non-naturally ocurring OB 
protein, analog or derivative thereof. 

15 

16. An OB protein/OB receptor protein complex 
preparation of claim 15 wherein said OB receptor protein 
is selected from among those set forth in any of claims 
3, 4, and 5. 

20 

17. A method of treating an individual for a 
therapeutic disorder selected from among obesity, 
diabetes, high blood lipid levels, and high cholesterol 
levels comprised of administering a therapeutic amount 

25 of an OB protein/OB receptor protein complex preparation 
of claims 15 or 16. 

18. A method of claim 17 wherein said OB 
protein/OB receptor protein complex preparation is 

30 formed in vivo by administering, into a patient, a first 
population of cells expressing an OB protein, and a 
second population of cells expressing an OB receptor 
protein. 
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19. A method of treating an individual for 
weight loss or weight maintenance for solely cosmetic 
purposes comprised of administering a therapeutic amount 
5 of an OB protein/OB receptor protein complex preparation 
containing an OB receptor protein moiety according to 
any of claims 1-5, or produced by the process according 
to claim 11. 

10 20. Use of an OB protein/OB receptor protein 

complex preparation, according to claims 15 or 16, for 
manufacturing a medicament for the treatment of obesity, 
diabetes, high blood lipid levels, or high cholesterol 
levels . 
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